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METHODS OF DIAGNOSIS OF BREAST CANCER, COMPOSITIONS 
AND METHODS OF SCREENING FOR MODULATORS OF BREAST 

CANCER 



CROSS-REFERENCES TO RELATED APPLICATIONS 

* * • 

This application claims priority to USSN 60/263,965, filed January 24, 2001; 
USSN 60/265,928, filed February 2, 2001; USSN 09/829,472 filed April 9, 2001; USSN 
60/282,698, filed April 9, 2001; USSN 60/288,590, filed May 4, 2001; and USSN 

! 

10 60/294,443, filed May 29, 2001 , all of which are incorporated herein by reference in their 
entirety. 



FIELD OF THE INVENTION 

The invention relates to the identification of nucleic acid and protein 
15 expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
breast cancer; and to the use of such expression profiles and compositions in the diagnosis, 
prognosis and therapy of breast cancer. The invention further relates to methods for 
identifying and using agents and/or targets that inhibit breast cancer. 

20 BACKGROUND OF THE INVENTION 

Breast cancer is one of the most frequently diagnosed cancers and the second 

leading cause of female cancer death in North America and northern Europe, with lung 
cancer being the leading cause. Lifetime incidence of the disease in the United States is one- 
in-eight, with a l-in-29 lifetime risk of dying from breast cancer. Early detection of breast 
25 cancer, using mammography, clinical breast examination, and self breast examination, has 
dramatically improved the treatment of the disease, although sensitivity is still major concern, 
as mammographic sensitivity has been estimated at only 60%-90%. Treatment of breast 
cancer consists largely of surgical lumpectomy or mastectomy, radiation therapy, anti- 
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hormone therapy, and/or chemotherapy. Although many breast cancer patients are effectively 
treated, the current therapies can all induce serious side effects which diminish quality of life. 
Deciding on a particular course of treatment is typically based on a variety of prognostic 

■ 

parameters and markers (Fitzgibbons et al., 2000, Arch. Pathol. Lab. Med. 124:966-978; 

5 Hamilton and Piccart, 2000, Ann. Oncol. 1 1 :647-663), including genetic predispostion 
markers BRCA-1 andBRCA-2 (Robson, 2000, J. Clin. Oncol. 18:113sup-118sup). 

Imaging of breast cancer for diagnosis has been problematic and limited. In 
addition, dissemination of tumor cells (metastases) to locoregional lymph nodes is an 
important prognostic factor; five year survival rates drop from 80 percent in patients with no 

1 0 lymph node metastases to 45 to 50 percent in those patients who do have lymph node 

metastases. A recent report showed that micrometastases can be detected from lymph nodes 
using reverse transcriptase-PCR methods based on the presence ofmRNA for 
carcinoembryonic antigen, which has previously been shown to be present, in the vast 
majority of breast cancers but not in normal tissues. Liefers et al., New England J. of Me& 

15 339(4):223 (1998). 

The identification of novel therapeutic targets and diagnostic markers is 
essential for improving the current treatment of breast cancer patients. Recent advances in 
molecular medicine have increased the interest in tumor-specific cell surface antigens that 
could serve as targets for various immunotherapeutic or small molecule strategies. Antigens 

20 suitable for immunotherapeutic strategies should be highly expressed in cancer tissues and 
ideally not expressed in normal adult tissues. Expression in tissues that are dispensable for 
life, however, may be tolerated. Examples of such antigens include Hei2/neu and the B-cell 
antigen CD20. Humanized monclonal antibodies directed to Her2/neu 
(Herceptin®/trastuzumab) are currently in use for the treatment of metastatic breast cancer 

25 (Ross and Fletcher, 1998, Stem Cells 16:413-428). Similarly, anti-CD20 monoclonal 
antibodies (Rituxin®/rituximab) are used to effectively treat non-Hodgekin , s lymphoma 
(Maloney et al, 1997, Blood 90:2188-2195; Leget and Czuczman, 1998, Curr. Opin. Oncol. 
10:548-551). 

Other potential immunotherapeutic targets have been identified for breast . 
30 cancer. One such target is polymorphic epithelial mucin (MUC1). MUC1 is a transmembrane 
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protein, present at the apical surface of glandular epithelial cells. It is often overexpressed in 
breast cancer, and typically exhibits an altered glycosylation pattern, resulting in an 
antigenically distinct molecule, and is in early clinical trials as a vaccine target (Gilewskiet 
al., 2000, Clin. Cancer Res. 6:1693-1701; Scholl et al., 2000, J. Immunother. 23:570-580). 

5 The tumor-expressed protein is often cleaved into the circulation, where it is detectable as the 
tumor marker, CA 15-3 (Bon et al., 1997, Clin. Chem. 43:585-593). However, many patients 
have tumors that express neither HER2 nor MUC-1 ; therefore, it is clear that other targets 
need to be identified to manage localized and metastatic disease. Many other genes have been 
reported to be overexpressed in breast cancer, such as EGFR (Sainsbury et al., 1987, Lancet 

10 1(8547):1398-1402), c-erbB3 (Naidu et al., 1988, Br. J. Cancer 78:1385-1390), FGFR2 
(Penault-Llorca et al., 1991, Int. J. Cancer 61:170-176), PKW (Preiherr et al., 2000, 
Anticancer Res. 20:2255-2264), MTA1 (Nawa et al., 2000, J. Cell Biochem. 79:202-212), 
breast cancer associated gene 1 (Kurt et al., 2000, Breast Cancer Res. Treat. 59:41-48). 
Although monoclonal antibodies to the protein products of some of these overexpressed 

15 genes have been reported (for review, see Green et al., 2000, Cancer Treat. Rev. 26:269-286), 
none are currently approved for breast cancer therapy in the US. 

Disclosures of certain genes and ESTs described as being expressed in breast 
chancer are found in international patent applications WO-99/33869, WO-97/25426, WO- 
97/02280 and WO-00/55173, WO-98/45328 and WO-00/22130. Similarly, genes and ESTs 

20 described as being expressed in breast cancer are disclosed in US Patent Nos. 5,759,776 and 
5,693,522. The utility of such genes is described in each of these publications, and their 
disclosures are incorporated herein in their entirety. 

While industry and academia have identified novel sequences, there has not 
been an equal effort exerted to identify the function of these novel sequences. The 

25 elucidation of a role for novel proteins and compounds in disease states for identification of 
therapeutic targets and diagnostic markers is essential for improving the current treatment of 
breast cancer patients. Accordingly, provided herein are molecular targets for therapeutic 
intervention in breast and other cancers. Additionally, provided herein are methods that can 
be used in diagnosis and prognosis of breast cancer. Further provided are methods that can 

30 be used to screen candidate bioactive agents for the ability to modulate breast cancer. 
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SUMMARY OF THE INVENTION 
The present invention therefore provides nucleotide sequences of genes that 
are up- and down-regulated in breast cancer cells. Such genes are useful for diagnostic 
5 purposes, and also as targets for screening for therapeutic compounds that modulate breast 
cancer, such as hormones or antibodies. Other aspects of the invention will become apparent 
to the skilled artisan by the following description of the invention. 

In one aspect, the present invention provides a method of detecting a breast 
cancer-associated transcript in a cell from a patient, the method comprising contacting a 
10 biological sample from the patient with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the present invention provides a method of determining 
the level of a breast cancer associated transcript in a cell from a patient. 

In one embodiment, the present invention provides a method of detecting a 
1 5 breast cancer-associated transcript in a cell from a patient, the method comprising contacting 
a biological sample from the patient with a polynucleotide that selectively hybridizes to a 

* 

sequence at least 80% identical to a sequence as shown in Tables 1-25. 

In one embodiment, the polynucleotide selectively hybridizes to a sequence at 
least 95% identical to a sequence as shown in Tables 1-25. 
20 In one embodiment, the biological sample is a tissue sample. In another 

embodiment, the biological sample comprises isolated nucleic acids, e.g., mRNA. 

In one embodiment, the polynucleotide is labeled, e.g., with a fluorescent 

* ■ ■ 

label. 

In one embodiment, the polynucleotide is immobilized on a solid surface. 
25 In one embodiment, the patient is undergoing a therapeutic regimen to treat 

breast cancer. In another embodiment, the patient is suspected of having metastatic breast 

» 

cancer. . 

In one embodiment, the patient is a human. 

■ 

In one embodiment, the breast cancer associated transcript is mRNA. 

4 



WO 02/059377 PCT/US02/02242 

( ■ 

■ 

In one embodiment, the method further comprises the step of amplifying- - 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 

In another aspect, the present invention provides a method of monitoring the 
efficacy of a therapeutic treatment of breast cancer, the method comprising the steps of: (i) 
5 providing a biological sample from a patient undergoing the therapeutic treatment; and (ii) 
determining the level of a breast cancer-associated transcript in the biological sample by 
contacting the biological sample with a polynucleotide that selectively hybridizes to a 
sequence at least 80% identical to a sequence as shown in Tables 1-25, thereby monitoring 
the efficacy of the therapy. In a further embodiment, the patient has metastatic breast cancer. 
10 In a further embodiment, the patient has a drag resistant form of breast cancer. 

In one embodiment, the method further comprises the step of: (iii) comparing 
the level of the breast cancer-associated transcript to a level of the breast cancer-associated 
transcript in a biological sample from the patient prior to, or earlier in, the therapeutic 
treatment. 

15 Additionally, provided herein is a method of evaluating the effect of a 

candidate breast cancer drug comprising administering the drug to a patient and removing a 
cell sample from the patient. The expression profile of the cell is then determined. This 
method may further comprise comparing the expression profile to an expression profile of a 
healthy individual. In a preferred embodiment, said expression profile includes a gene of 

20 Tables 1-25. 

In one aspect, the present invention provides an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1 -25. 

In one embodiment, an expression vector or cell comprises the isolated nucleic 

acid. 

25 In one aspect, the present invention provides an isolated polypeptide which is 

encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1-25. 

In another aspect, the present invention provides an antibody that specifically 
binds to an isolated polypeptide which is encoded by a nucleic acid molecule having 
polynucleotide sequence as shown in Tables 1-25. 

■ 
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In one embodiment, the antibody is conjugated to an effector component, e.g., 
a fluorescent label, a radioisotope or a cytotoxic chemical. 

In one embodiment, the antibody is an antibody fragment. In another 

embodiment, the antibody is humanized. 
5 In one aspect, the present invention provides a method of detecting a breast 

caincer cell in a biological sample from a patient, the method comprising contacting the 
biological sample with an antibody as described herein. 

In another aspect, the present invention provides a method of detecting 
antibodies specific to breast cancer in a patient, the method comprising contacting a 

i 

1 0 biological sample from the patient with a polypeptide encoded by a nucleic acid comprising a 
sequence from Tables 1-25. 

In another aspect, the present invention provides a method for identifying a 
compound that modulates a breast cancer-associated polypeptide, the method comprising the 

M 

steps of: (i) contacting the compound with a breast cancer-associated polypeptide, the 

■ ■ 

1 5 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 
80% identical to a sequence as shown in Tables 1-25; and (ii) determining the functional 
effect of the compound upon the polypeptide. 

In one embodiment, the functional effect is a physical effect, an enzymatic 
effect, or a chemical effect. 
20 In one embodiment, the polypeptide is expressed in a eukaryotic host cell or 

cell membrane. In another embodiment, the polypeptide is recombinant. 

In one embodiment, the functional effect is determined by measuring ligand 
binding to the polypeptide. 

In another aspect, the present invention provides a method of inhibiting 
25 proliferation of a breast cancer-associated cell to treat breast cancer in a patient, the method 
.. comprising the step of administering to the subject a therapeutically effective amount of a 
compound identified as described herein. 

In one embodiment, the compound is an antibody. 
In another aspect, the present invention provides a drug screening assay 
30 comprising the steps of: (i) administering a test compound to a mammal having breast cancer 
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or to a cell sample isolated therefrom; (ii) comparing the level of gene expression of a 
polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

■ * 

as shown in Tables 1 -25 in a treated cell or mammal with the level of gene expression of the 
polynucleotide in a control cell sample or mammal, wherein a test compound that modulates 
5 the level of expression of the polynucleotide is a candidate for the treatment of breast cancer. 

In one embodiment, the control is a mammal with breast cancer or a cell 
sample therefrom that has not been treated with the test compound. In another embodiment, 
the control is a normal cell or mammal. ' 

■ r I 

In one embodiment, the test compound is administered in varying amounts or 
10 concentrations. In another embodiment, the test compound is administered for varying time 
periods. In another embodiment, the comparison can occur after addition or removal of the 
drug candidate. 

In one embodiment, the levels of a plurality of polynucleotides that selectively 

V 

hybridize to a sequence at least 80% identical to a sequence as shown in Tables 1-25 are 
15 individually compared to their respective levels in a control cell sample or mammal. In a 
preferred embodiment the plurality of polynucleotides is from three to ten. 

In another aspect, the present invention provides a method for treating a 
mammal having breast cancer comprising administering a compound identified by the assay 
described herein. 

20 In another aspect, the present invention provides a pharmaceutical 

composition for treating a mammal having breast cancer, the composition comprising a 
compound identified by the assay described herein and a physiologically acceptable 
excipient. 

In one aspect, the present invention provides a method of screening drug 
25 candidates by providing a cell expressing a gene that is up- and down-regulated as in a breast 
cancer. In one embodiment, a gene is selected from Tables 1-25. The method further 
includes adding a drug candidate to the cell and determining the effect of the drug candidate 
on the expression of the expression profile gene. 

In one embodiment, the method of screening drug candidates includes 
30 comparing the level of expression in the absence of the drug candidate to the level of 
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expression in the presence of the drug candidate, wherein the concentration of the drug 
candidate can vary when present, and wherein the comparison can occur after addition or 
removal of the drug candidate. In a preferred embodiment, the cell exposes at least two 
expression profile genes. The profile genes may show an increase or decrease. 
5 Also provided is a method of evaluating the effect of a candidate breast cancer 

drug comprising administering the drug to a transgenic animal expressing or over-expressing 
the breast cancer modulatory protein, or an animal lacking the breast cancer modulatory 
protein, for example as a result of a gene knockout. 

* 

Moreover, provided herein is a biochip comprising one or more nucleic acid 

i 

1 0 segments of Tables 1-25, wherein the biochip comprises fewer than 1 000 nucleic acid probes. 
Preferably, at least two nucleic acid segments are included. More preferably, at least three 
nucleic acid segments are included. 

Furthermore, a method of diagnosing a disorder associated with breast cancer 
is provided. The method comprises determining the expression of a gene of Tables 1-25, 

15 preferably a gene of Table 25, in a first tissue type of a first individual, and comparing the 
distribution to the expression of the gene from a second normal tissue type from the first 
individual or a second unaffected individual. A difference in the expression indicates that the 
first individual has a disorder associated with breast cancer. 

In a further embodiment, the biochip also includes a polynucleotide sequence 

20 of a gene that is not up- and down-regulated in breast cancer. 

In one embodiment a method for screening for a bioactive agent capable of 
interfering with the binding of a breast cancer modulating protein (breast cancer modulatory 
protein) or a fragment thereof and an antibody which binds to said breast cancer modulatory 
protein or fragment thereof. In a preferred embodiment, the method comprises combining a 

25 breast cancer modulatory protein or fragment thereof, a candidate bioactive agent and an 
antibody which binds to said breast cancer modulatory protein or fragment thereof. The 
method further includes determining the binding of said breast cancer modulatory protein or 
fragment thereof and said antibody. Wherein there is a change in binding, an agent is 
identified as an interfering agent. The interfering agent can be an agonist or an antagonist 

30 Preferably, the agent inhibits breast cancer. 
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Also provided herein are methods of eliciting an immune response in an 
individual. In one embodiment a method provided herein comprises administering to an 
individual a composition comprising a breast cancer modulating protein, or a fragment 
thereof. In another embodiment, the protein is encoded by a nucleic acid selected from those 
5 of Tables 1-25. 

Further provided herein are compositions capable of eliciting an immune 
response in an individual. In one embodiment, a composition provided herein comprises a 
breast cancer modulating protein, preferably encoded by a nucleic acid of Tables 1-25, more 
preferably of Table 25, or a fragment thereof, and a pharmaceutically acceptable carrier. In 
10 another embodiment, said composition comprises a nucleic acid comprising a sequence 
encoding a breast cancer modulating protein, preferably selected from the nucleic acids of 
Tables 1-25, and a pharmaceutically acceptable carrier. 

Also provided are methods of neutralizing the effect of a breast cancer protein, 
or a fragment thereof, comprising contacting an agent specific for said protein with said 
15 protein in an amount sufficient to effect neutralization. In another embodiment, the protein is 
encoded by a nucleic acid selected from those- of Tables 1-25. 

In another aspect of the invention, a method of treating an individual for breast 
cancer is provided. In one embodiment, the method comprises administering to said 
individual an inhibitor of a breast cancer modulating protein. In another embodiment, the 
20 method comprises administering to a patient having breast cancer an antibody to a breast 

c&ncer modulating protein conjugated to a therapeutic moiety. Such a therapeutic moiety can 
be a: cytotoxic agent or a radioisotope. 

DETAILED DESCRIPTION OF THE INVENTION 
25 In accordance with the objects outlined above, the present invention provides 

novel methods for diagnosis and prognosis evaluation for breast cancer (PC), including 
metastatic breast cancer, as well as methods for screening for compositions which modulate 
breast cancer. Also provided are methods for treating breast cancer. 

Tables 1-24B provide unigene cluster identification numbers for the 
30 nucleotide sequence of genes that exhibit increased or decreased expression in breast cancer 

9 
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samples. Tables 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 13, 18, 19, 20, 2l, and 22 list those genes that are 
up-regulated in breast cancer cells. Table 14 lists those genes that are highly upregulated in 
breast cancer cells. Table 1, 2, 3, 15, and 23 list genes that are down-regulated in breast 
cancer cells and Table 16, lists genes that are highly down-regulated in breast cancer genes. 
5 The Tables also provide an exemplar accession number that provides a nucleotide sequence 
that is part of the unigene cluster. 

Definitions 

The term "breast cancer protein" or "breast cancer polynucleotide" or "breast 

« 

1 0 cancer-associated transcript" refers to nucleic acid and polypeptide polymorphic variants, 
alleles, mutants, and interspecies homologies that: (1) have a nucleotide sequence that has 
greater than about 60% nucleotide sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, 
preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater, nucleotide 
sequence identity, preferably over a region of over a region of at least about 25, 50, 100, 200, 

15 500, 1000, or more nucleotides, to a nucleotide sequence of or associated with a gene of 

Tables 1-25; (2) bind to antibodies, e.g., polyclonal antibodies, raised against an immunogen 

comprising an amino acid sequence encoded by a nucleotide sequence of or associated with a 

■ 

gene of Tables 1-25, and conservatively modified variants thereof; (3) specifically hybridize 
under stringent hybridization conditions to a nucleic acid sequence, or the complement 

20 thereof of Tables 1-25 and conservatively modified variants thereof or (4) have an amino acid 
sequence that has greater than about 60% amino acid sequence identity, 65%, 70%, 75%, 
80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% or greater 
amino sequence identity, preferably over a region of over a region of at least about 25, 50, 
i00, 200, 500, 1000, or more amino acid, to an amino acid sequence encoded by a nucleotide 

25 sequence of or associated with a gene of Tables 1-25. A polynucleotide or polypeptide 
sequence is typically from a mammal including, but not limited to, primate, e.g., human; 
rodent, e.g., rat, mouse, hamster; cow, pig, horse, isheep, or other mammal. A "breast cancer 
polypeptide" and a "breast cancer polynucleotide," include both naturally occurring or 
recombinant forms. 

10 
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A "full length" breast cancer protein or nucleic acid refers to a breast cancer 
polypeptide or polynucleotide sequence, or a variant thereof that contains all of the elements 
normally contained in one or more naturally occurring, wild type breast cancer 
polynucleotide or polypeptide sequences. The "full length" may be pjrior to, or after, various 
5 stages of post-translation processing or splicing, including alternative splicing. 

■ 

'Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of a breast cancer protein, polynucleotide or 
transcript. Such samples include, but are not limited to, tissue isolated from primates, e.g., 

* r # 

humans, or rodents, e.g., mice, and rats. Biological samples may also include sections of 

10 tissues such as biopsy and autopsy samples, frozen sections taken for histologic purposes, 
blood, plasma, serum, sputum, stool, tears, mucus, hair, skin, etc. Biological samples also 
include explants and primary and/or transformed cell cultures derived from patient tissues. A 
biological sample is typically obtained from a eukaryotic organism, most preferably a 
mammal such as a primate e.g., chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea 

15 pig, rat, mouse; rabbit; or a bird; reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g. 9 
isolated by another person, at another time, and/or for another purpose), or by performing the 

20 methods of the invention in vivo. Archival tissues, having treatment or outcome history, will 
be particularly useful. 

The terms "identical" or percent "identity," in the context of two or more 
nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 

25 same (i.e., about 60% identity, preferably 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or higher identity over a specified region, when compared and 
aligned for maximum correspondence over a comparison window or designated region) as 
measured using a BLAST or BLAST 2.0 sequence comparison algorithms with default 
parameters described below, or by manual alignment and visual inspection (see, e.g. t NCBI 

30 web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such sequences are then said to 
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be "substantially identical." This.definition also refers to, or may be applied to, the 
compliment of a test sequence. The definition also includes sequences that have deletions 

and/or additions, as well as those that have substitutions, as well as naturally occurring, e.g., 

. 

polymorphic or allelic variants, and man-made variants. As described below, the preferred 
5 algorithms can account for gaps and the like. Preferably, identity exists over a region that is 
at least about 25 amino acids or nucleotides in length, or more preferably over a region that is 
50-100 amino acids, or nucleotides in length. 

For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 

1 0 and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

15 A "comparison window", as used herein, includes reference to a segment of 

one of the number of contiguous positions selected from the group consisting typically of 
from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which 
a sequence may be compared to a reference sequence of the same number of contiguous 

» 

positions after the two sequences are optimally aligned. Methods of alignment of sequences 
20 for comparison are well-known in the art. Optimal alignment of sequences for comparison 

can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 

Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol 

Biol 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 7. 

Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
25 (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 

Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

visual inspection {see, e.g., Current Protocols in Molecular Biology (Ausubel et al. , eds. 

1995 supplement)). 

Preferred examples of algorithms that are suitable for determining percent 
30 sequence identity and sequence similarity include the BLAST and BLAST 2.0 algorithms, 

12 
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which are^described in Altschul et al, Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et 
al, J. Mol Biol. 215:403-410 (1990). BLAST and BLAST 2.0 are used, with the parameters 
described herein, to determine percent sequence identity for the nucleic acids and proteins of 
the invention. Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/)- This 

■ 

algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short 
words of length W in the query sequence, which either match or satisfy some positive-valued 
threshold score T when aligned with a word of the same length in a database sequence. T is 
referred to as the neighborhood word score threshold (Altschul et al, supra). These initial 
neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are extended in both directions along each sequence for as far as the 
cumulative alignment score can be increased. Cumulative scores are calculated using, e.g., 

* 

for nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always > 0) and N (penalty score for mismatching residues; always < 0). For amino acid 
sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word 
hits in each direction are halted when: the cumulative alignment score falls ofFby the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the . 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 
uses as defaults a wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl Acad. Sci. USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 1 0, M=5, N=-4, and a comparison of both strands. 

The BLAST algorithm also performs a statistical analysis of the similarity 
between two sequences (see, e.g., Karlin & Altschul, Proc. Nat 'I Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 
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nucleic acid is considered similar to a reference sequence if the smallest sum probability in-a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001 . Log values may 
be large negative numbers, e.g., 5, 10, 20, 30, 40, 40, 70, 90, 110, 150, 170, etc, 
5 An indication that two nucleic acid sequences or polypeptides are substantially 

identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 
reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, e.g., where the two peptides differ only by conservative substitutions. Another 

10 indication that two nucleic acid sequences are substantially identical is that the two molecules 
or their complements hybridize to each other under stringent conditions, as described below. 
Yet another indication that two nucleic acid sequences are substantially identical is that the 
same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 

1 5 expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 

* . i 

prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g. t the American Type Culture 
Collection catalog or web site, www.atcc.org). 

20 The terms "isolated," "purifiedj" or "biologically pure" refer to material that is 

substantially or essentially free from components that normally accompany it as found in its 
native state. Purity and homogeneity are typically determined using analytical chemistry 
techniques such as polyacrylamide gel electrophoresis or high performance liquid 
chromatography. A protein or nucleic acid that is the predominant species present in a 

25 preparation is substantially purified. In particular, an isolated nucleic acid is separated from 
some open reading frames that naturally flank the gene and encode proteins other than protein 
encoded by the gene. The term "purified" in some embodiments denotes that a nucleic acid 
or protein gives rise to essentially one band in an electrophoretic gel. Preferably, it means 
that the nucleic acid or protein is at least 85% pure, more preferably at least 95% pure, and 

30 most preferably at least 99% pure. ''Purify" or "purification" in other embodiments means 
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removing at least one contaminant from the composition to be purified. In this sense, 

» * 

purification does not require that the purified compound be homogenous, e.g., 100% pure. 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 

A. 

herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
5 in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers, those 
containing modified residues, and npn-naturally occurring amino acid polymer; 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 
as well as amino acid analogs and amino acid mimetics that function similarly to the naturally 
10 occurring amino acids. Naturally occurring amino acids are those encoded by the genetic 
code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 

» 

the same basic chemical structure as a naturally occurring amino acid, e.g., an a carbon that is 
bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 

15 norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs may have 

modified R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic 
chemical structure as a naturally occurring amino acid. Amino acid mimetics refers to 
chemical compounds that have a structure that is different from the general chemical 
structure of an amino acid, but that functions similarly to a naturally occurring amino acid. 

20 Amino acids may be referred to herein by either their commonly known three 

letter symbols or by the one-letter symbols recommended by the IUP AC-RJB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

"Conservatively modified variants" applies to both amino acid and nucleic 

25 acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical or associated, e.g., naturally contiguous, sequences. Because of the 
degeneracy of the genetic code, a large number of functionally identical nucleic acids encode 

30 most proteins. For instance, the codons GCA, GCC, GCG and GCU all encode the amino 

■ • 
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acid alanine. Thus, at every position where an alanine is specified by a codon, the codon can 
be altered to another of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of 
conservatively modified variations. Every nucleic acid sequence herein which encodes a 
5 polypeptide also describes silent variations of the nucleic acid. One of skill will recognize 
that in certain contexts each codon in a nucleic acid (except AUG, which is ordinarily the 
only codon for methionine, and TGG, which is ordinarily the only codon for tryptophan) can 
be modified to yield a functionally identical molecule. Accordingly, often silent variations of 
a nucleic acid which encodes a polypeptide is implicit in a described sequence with respect to 
10 the expression product, but not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 
sequence which alters, adds or deletes a single amino acid or a small percentage of amino 

n. 

acids in the encoded sequence is a "conservatively modified variant" where the alteration 
1 5 results in the substitution of an amino acid with a chemically similar amino acid. 

Conservative substitution tables providing functionally similar amino acids are well known in 

, * > 

the art. Such conservatively modified variants are in addition to and do not exclude 
polymorphic variants, interspecies homologs, and alleles of the invention.typically 

■ 

conservative substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), 
20 Glutamic acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) 

Isoleucine (I), Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), 

Tryptophan (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) {see, 

e.g t Creighton, Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
25 terms of various levels of organization. For a general discussion of this organization, see, 

eg., Alberts et al 9 Molecular Biology of the Cell (3 ri ed., 1994) and Cantor & Schimmel, 

Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1 980). 

"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 

■ 

structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
30 structures are commonly known as domains. Domains are portions of a polypeptide that 
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often form a compact unit of the polypeptide and are typically 25 to approximately 500 

» 

amino acids long. Typical domains are made up of sections of lesser organization such as 
stretches of p-sheet and ex-helices. 'Tertiary structure" refers to the complete three 
dimensional structure of a polypeptide monomer. "Quaternary structure" refers to the three 
5 dimensional structure formed, usually by the noncovalent association of independent tertiary 
units. Anisotropic terms are also known as energy terms. 

"Nucleic acid" or "oligonucleotide" or "polynucleotide" or grammatical 
equivalents used herein means at least two nucleotides covalently linked together. 
Oligonucleotides are typically from about 5, 6, 7, 8, 9, 10, 12, 15, 25, 30, 40, 50 or more 

10 nucleotides in length, up to about 100 nucleotides in length. Nucleic acids and 

polynucleotides are a polymers of any length, including longer lengths, e.g., 200, 300, 500, 
1000, 2000, 3000, 5000, 7000, 10,000, etc. A nucleic acid of the present invention will 
generally contain phosphodiester bonds, although in some cases, nucleic acid analogs are 
included that may have alternate backbones, comprising, e.g., phosphoramidate, 

1 5 phosphorothioate, phosphorodithioate, or O-methylphophoroamidite linkages (see Eckstein, 
Oligonucleotides and Analogues: A Practical Approach, Oxford University Press); and 
peptide nucleic acid backbones and linkages. Other analog nucleic acids include those with 
positive backbones; non-ionic backbones, and non-ribose backbones, including those 

■ ♦ 

described in U.S. Patent Nos. 5,235,033 and 5,034,506, and Chapters 6 and 7, ASC 

20 Symposium Series 580, Carbohydrate Modifications in Antisense Research, Sanghiri & 

Cook, eds.. Nucleic acids containing one or more carbocyclic sugars are also included within 

one definition of nucleic acids. Modifications of the ribose-phosphate backbone may be done 

for a variety of reasons, e.g. to increase the stability and half-life of such molecules in 

physiological environments or as probes on a biochip. Mixtures of naturally occurring 

25 nucleic acids and analogs can be made; alternatively, mixtures of different nucleic acid 

analogs, and mixtures of naturally occurring nucleic acids and analogs may be made. 

A variety of references disclose such nucleic acid analogs, including, for 

example, phosphoramidate (Beaucage et al., Tetrahedron 49(10):1925 (1993) and references 

therein; Letsinger, J. Org. Chem. 35:3800 (1970); Sprinzl et al., Eur. J. Biochem. 81:579 

30 (1977); Letsinger et al., Nucl. Acids Res. 14:3487 (1986); Sawai et al, Chem. Lett 805 
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-(1984), Letsinger et al., J. Am. Chem. Soc. 110:4470 (1988); andPauwels et aL, Chemica 
Scripta 26:141 91986)), phosphorothioate (Mag et aL, Nucleic Acids Res. 19:1437 (1991); 
and U.S. Patent No. 5,644,048), phosphorodithioate (Briu et al., J. Am. Chem. Soc. 1 1 1 :2321 
(1989), O-methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: 
5 A Practical Approach, Oxford University Press), and peptide nucleic acid backbones and 
linkages (see Egholm, J. Am. Chem. Soc. 1 14:1895 (1992); Meier et al., Chem. Int. Ed. Engl. 
31:1008 (1992); Nielsen, Nature, 365:566 (1993); Carlsson et al., Nature 380:207 (1996), all 
of which are incorporated by reference). Other analog nucleic acids include those with 
positive backbones (Denpcy et aL, Proc. Natl. Acad. Sci, USA 92:6097 (1995); non-ionic ( 
10 backbones (U.S. Patent Nos. 5,386,023, 5,637,684, 5,602,240, 5,216,141 and 4,469,863; 
Kiedrowshi et al., Angew. Chem. Intl. Ed. English 30:423 (1991); Letsinger et al., I Am. 
Chem. Soc. 110:4470 (1988); Letsinger et al., Nucleoside & Nucleotide 13:1597 (1994); 
Chapters 2 and 3, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 

H. 

Research", Ed. Y.S. Sanghui and P. Dan Cook; Mesmaeker et al., Bioorganic & Medicinal 

15 Chem. Lett. 4:395 (1994); Jeffs et al., J. Biomolecular NMR 34:17 (1994); Tetrahedron Lett. 
37:743 (1996)) and non-ribose backbones, including those described in U.S. Patent Nos. 
5,235,033 and 5,034,506, and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate 
Modifications in Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids 
containing one or more carbocyclic sugars are also included within one definition of nucleic 

20 acids (see Jenkins et al., Chem. Soc. Rev. (1995) pp 169-176). Several nucleic acid analogs 
are described in Rawls, C&E News June 2, 1997 page 35. All of these references are hereby 
expressly , incorporated by reference. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

25 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs, DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

30 due to their non-ionic nature, hybridization of the bases attached to these backbones is 
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relatively insensitive to salt concentration. In addition, PNAs are not degraded^by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 
5 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

1 0 xanthine hypoxarithine, isocytosine, isoguanine, etc. "Transcript 5 5 typically refers to a 

naturally occurring RNA, e.g., a pre-mRNA, hnRNA, or mRNA. As used herein, the term 
"nucleoside" includes nucleotides and nucleoside and nucleotide analogs, and modified 
nucleosides such as amino modified nucleosides. In addition, "nucleoside" includes non- 
naturally occurring analog structures. Thus, e.g. the individual units of a peptide nucleic 

15 acid, each containing a base, are referred to herein as a nucleoside. 

A "label" or a "detectable moiety 5 * is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g. 9 as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 

20 or other entities which can be made detectable, e.g., by incorporating a radiolabel into the 
peptide or used to detect antibodies specifically reactive with the peptide. The labels may be 
incorporated into the breast cancer nucleic acids, proteins and antibodies at any position. 
Any method known in the art for conjugating the antibody to the label may be employed, 
including those methods described by Hunter et al., Nature. 144:945 (1 962); David et al., 

25 Biochemistry. 13:1014 (1974): Pain et al.. J. Immunol. Meth.. 40:219 (1981); and Nygren L 
Histochem. and Cvtochem.. 30:407 (1982). 

An "effector* 5 or "effector moiety 55 or "effector component 55 is a molecule that 
is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 

30 The "effector* 5 can be a variety of molecules including, e.g., detection moieties including 
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radioactive compounds, fluorescent- compounds, an enzyme or substrate, tags such as epitope 
tags, a toxin; activatable moieties, a chemotherapeutic agent; a lipase; an antibiotic; or a 
radioisotope emitting "hard" e.g., beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
5 cpvalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 
using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the, other, e.g., biotin, streptavidin. 

10 As 1 used herein a "nucleic acid probe or oligonucleotide" is defined as a 

nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 
through hydrogen bond formation. As used herein, a probe may include natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 

1 5 may be joined by a linkage other than a phosphodiester bond, so long as it does not 

functionally interfere with hybridization. Thus, e.g., probes may be peptide nucleic acids in 
which the constituent bases are joined by peptide bonds rather than phosphodiester linkages. 
It will be understood by one of skill in the art that probes may bind target sequences lacking 
complete complementarity with the probe sequence depending upon the stringency of the 

20 hybridization conditions. The probes are preferably directly labeled as with isotopes, 

chromophores, lumiphores, chromogens, or indirectly labeled such as with biotin to which a 
streptavidin complex may later bind. By assaying for the presence or absence of the probe, 

one can detect the presence or absence of the select sequence or subsequence. Diagnosis or 

♦ 

prognosis may be based at the genomic level, or at the level of RNA or protein expression. 

25 The term "recombinant* ' when used with reference, e.g., to a cell, or nucleic 

acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, e.g., 
recombinant cells express genes that are not found within the native (non-recombinant) form 

30 of the cell or express native genes that are otherwise abnormally expressed, under expressed 
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or not expressed at all. By the term "recombinant nucleic acid" herein is meant nucleic acid, 
originally formed in vitro, in general, by the manipulation of nucleic acid, e.g., using ; 
polymerases and endonucleases, in a form not normally found in nature. In this manner, ' 
operably linkage of different sequences is achieved. Thus an isolated nucleic acid, in a linear 
5 form, or an expression vector formed in vitro by ligating DNA molecules that are not 

normally joined, are both considered recombinant for the purposes of this invention. It is • 
understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e., using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 

10 recombinantly, although subsequently replicated non-recombinantly, are still considered 

recombinant for the purposes of the invention. Similarly, a Recombinant protein" is a protein 
made using recombinant techniques, i.e., through the expression of a recombinant nucleic 
acid as depicted above. 

The term heterologous" when used with reference to portions of a nucleic 

1 5 acid indicates that the nucleic acid comprises two or more subsequences that are not normally 
found in the same relationship to each other in nature. For instance, the nucleic acid is 
typically recombinantly produced, having two or more sequences, e.g., from unrelated genes 
arranged to make a new functional nucleic acid, e.g., a promoter from one source and a 
coding region from another source. Similarly, a heterologous protein will often refer to two 

20 or more subsequences that are not found in the same relationship to each other in nature (e.g., 
a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 
acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
25 promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 

■ 

environmental and developmental conditions. An "inducible" promoter is a promoter that is 
active under environmental or developmental regulation. The term "operably linked" refers 

30 to a functional linkage between a nucleic acid expression control sequence (such as a 
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promoter, or array of transcription factor binding sites) and a second nucleic acid sequencer 

■« 

wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

An "expression vector" is a nucleic acid construct, generated recombinantly or 

» 

5 synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 

The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
1 0 duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 

stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 

The phrase "stringent hybridization conditions" refers to conditions under 

♦ 

, i 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 

15 nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 
Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 

20 (1993). Generally, stringent conditions are selected to be about 5-10°C lower than the 

thermal melting point (Tm) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 
of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 

25 equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 
50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 

30 formamide. For selective or specific hybridization, a positive signal is at least two times 
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background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 
incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2xSSC,and 
0. 1 % SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
5 amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 

■ > 

to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C fpr 

i 

10 30 sec - 2 miri., ah annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
reactions are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 

15 still substantially identical if the polypeptides which they encode are substantially identical. 
This occurs, e.g., when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 

20 l%SDSat37°C,andawashinlXSSCat45°C. A positive hybridization is at least twice 
background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g., 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

25 The phrase "functional effects" in the context of assays for testing compounds 

that modulate activity of a breast cancer protein includes the determination of a parameter 
that is indirectly or directly under the influence of the breast cancer protein or nucleic acid, 
e.g., a functional, physical, or chemical effect, such as the ability to decrease breast cancer. It 
includes ligand binding activity; cell growth on soft agar; anchorage dependence; contact 
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inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 
metastasis, and other characteristics of breast cancer cells. "Functional effects" include in 

5 vitro, in vivo 9 and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of a breast 
cancer protein sequence, e.g., functional, enzymatic, physical and chemical effects. Such 
functional effects can be measiured by any means known to those skilled in the art, e.g., 

10 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the breast cancer protein; 
measuring binding activity or binding assays, e.g. binding to antibodies or other ligands, and 
measuring cellular proliferation. Determination of the functional effect of a compound on 

1 5 breast cancer can also be performed using breast cancer assays known to those of skill in the 
art such as an in vitro assays, e.g., cell growth on soft agar; anchorage dependence; contact 
inhibition and density limitation of growth; cellular proliferation; cellular transformation; 
growth factor or serum dependence; tumor specific marker levels; invasiveness into Matrigel; 
tumor growth and metastasis in vivo; mRNA and protein expression in cells undergoing 

20 metastasis, and other characteristics of breast cancer cells. The functional effects can be 
evaluated by many means known to those skilled in the art, e.g., microscopy for quantitative 
or qualitative measures of alterations in morphological features, measurement of changes in 
RNA or protein levels for breast cancer-associated sequences, measurement of RNA stability, 
identification of downstream or reporter gene expression (CAT, luciferase, p-gal, GFP and 

25 the like), e.g., via chemiluminescence, fluorescence, colorimetric reactions, antibody binding, 
inducible markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of breast cancer polynucleotide 
and polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 
or compounds identified using in vitro and in vivo assays of breast cancer polynucleotide and 

30 polypeptide sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block 
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activity, decrease, prevent, delay activation, inactivate, desensitize, or down regulate the 
activity or expression of breast cancer proteins, e.g., antagonists. Antisense nucleic acids 
may seem to inhibit expression and subsequent function of the protein. "Activators" are 
compounds that increase, open, activate, facilitate, enhance activation, sensitize, agonize, or 
5 up regulate breast cancer protein activity. Inhibitors, activators, or modulators also include 
genetically modified versions of breast cancer proteins, e.g., versions with altered activity, as 
well as naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small 
chemical molecules and the like. Such assays for inhibitors and activators include, e.g., 
expressing the breast cancer protein in vitro, in cells, or cell membranes, applying putative 

10 modulator compounds, and then determining the functional effects on activity, as described 
above. Activators and inhibitors of breast cancer can also be identified by incubating breast 
cancer cells with the test compound and determining increases or decreases in the expression 
of 1 or more breast cancer proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more breast 
cancer proteins, such as breast cancer proteins encoded by the sequences set out in Tables 1- 

15 25. 

Samples or assays comprising breast cancer proteins that are treated with a 
potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 

20 of a polypeptide is achieved when the activity value relative to the control is about 80%, 
preferably 50%, more preferably 25-0%. Activation of a breast cancer polypeptide is 
achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 150%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1 000-3000% higher. 

25 The phrase "changes in cell growth" refers to any change in cell growth and 

proliferation characteristics in vitro or in vivo 9 such as formation of foci, anchorage 
independence, semi-solid or soft agar growth, changes in contact inhibition and density 
limitation of growth, loss of growth factor or serum requirements, changes in cell 

» 

morphology, gaining or losing immortalization, gaining or losing tumor specific markers, 
30 ability to form or suppress tumors when injected into suitable animal hosts, and/or 
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immortalization of the cell. See, e.g., Freshney, Culture ofAnimal Cells a Manual of Basic 
Technique pp. 231-241 (3 rf ed. 1994). 

"Tumor cell" refers to precancerous, cancerous, and normal cells in a tumor. 
"Cancer cells," "transformed" cells or 'transformation" in tissue culture, refers 
5 to spontaneous or induced phenotypic changes that do not necessarily involve the uptake of 
new genetic material. Although transformation can arise from infection with a transforming 
virus and incorporation of new genomic DNA, or uptake of exogenous DNA, it can also arise 
spontaneously or following exposure to a carcinogen, thereby mutating an endogenous gene. 
Transformation is associated with phenotypic changes, such as immortalization of cells, 
10 aberrant growth control, nonmorphological changes, and/or malignancy (see, Freshney, 
Culture ofAnimal Cells a Manual of Basic Technique (3 Td ed. 1994)). 

"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
15 epsilon, and mu constant region genes, as well as the myriad immunoglobulin variable region 
genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody or 
its functional equivalent will be most critical in specificity and affinity of binding. See Paul, 
20 Fundamental Immunology. 

An exemplary immunoglobulin (antibody) structural unit comprises a 
tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pah- 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 100 to 1 10 or more amino acids primarily 
25 responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (Vh) refer to these light and heavy chains respectively. 

Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 
characterized fragments produced by digestion with various peptidases. Thus, e.g., pepsin 
digests an antibody below the disulfide linkages in the hinge region to produce F(ab)'2, a 
30 dimer of Fab which itself is a fight chain joined to Vh-Ch1 by a disulfide bond. The F(ab)' 2 
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may be reduced under mild conditions to break the disulfide linkage in the hinge region, 
thereby converting the F(ab)' 2 dimer into an Fab' monomer. The Fab* monomer is 
essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
5 antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 
herein, also includes antibody fragments either produced by the modification of whole 
antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g, McCafferty et al, Nature 

10 348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 
antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 
Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4:72 (1983); Cole et al, pp. 
77-96 in Monoclonal Antibodies and Cancer Therapy (1985); Coligan, Current Protocols in 

15 Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual (1988); and Goding, 
Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). Techniques for the 
production of single chain antibodies (U.S. Patent 4,946,778) can be adapted to produce 
antibodies to polypeptides of this invention. Also, transgenic mice, or other organisms such 
as other mammals, may be used to express humanized antibodies. Alternatively, phage 

20 display technology can be used to identify antibodies and heteromeric Fab fragments that 
specifically bind to selected antigens (see, e.g. t McCafferty et al, Nature 348:552-554 
(1990); Marks et al, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 

25 (variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, eg., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

30 
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Identification of breast cancer-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint* ' of the state of the 
5 sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
characteristic of the state of the cell. That is, normal tissue (e.g., normal breast or other 
tissue) may be distinguished from cancerous or metastatic cancerous tissue of the breast, or 
breast cancer tissue or metastatic breast cancerous tissue can.be compared with tissue 
10 samples of breast and other tissues from surviving cancer patients. By comparing expression 
profiles of tissue in known different breast cancer states, information regarding which genes 
are important (including both up- and down-regulation of genes) in each of these states is 
obtained. 

The identification of sequences that are differentially expressed in breast 
15 cancer versus non-breast cancer tissue allows the use of this information in a number of ways. 
For example, a particular treatment regime may be evaluated: does a chemotherapeutic drug 
act to down-regulate breast cancer, and thus tumor growth or recurrence, in a particular 
patient. Similarly, diagnosis and treatment outcomes may be done or confirmed by 
comparing patient samples with the known expression profiles. Metastatic tissue can also be 
20 analyzed to determine the stage of breast cancer in the tissue. Furthermore, these gene 
expression profiles (or individual genes) allow screening of drug candidates with an eye to 
mimicking or altering a particular expression profile; e.g., screening can be done for drugs 
that suppress the breast cancer expression profile. This may be done by making biochips 
comprising sets of the important breast cancer genes, which can then be used in these screens. 
25 These methods can also be done on the protein basis; that is, protein expression levels of the 
breast cancer proteins can be evaluated for diagnostic purposes or to screen candidate agents. 
Li addition, the breast cancer nucleic acid sequences can be administered for gene therapy 
purposes, including the administration of antisense nucleic acids, or the breast cancer proteins 
(including antibodies and other modulators thereof) administered as therapeutic drugs. 
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Thus the present invention provides nucleic acid and protein sequences that - - 
are differentially expressed in breast cancer, herein termed "breast cancer sequences." As 
outlined below, breast cancer sequences include those that are up-regulated (Le., expressed at 
a higher level) in breast cancer, as well as those that are down-regulated (i.e., expressed at a 
5 lower level). In a preferred embodiment, the breast cancer sequences are from humans; 
however, as will be appreciated by those in the art, breast cancer sequences from other 
organisms may be useful in animal models of disease and drug evaluation; thus, other breast 
cancer sequences are provided, from vertebrates, including mammals, including rodents (rats, 
mice, hamsters, guinea pigs, etc.), primates, farm animals (including sheep, goats, pigs, cows, 
10 horses, etc.) and pets, e.g., (dogs, cats, etc.). Breast cancer sequences from other organisms 
may be obtained using the techniques outlined below. 

Breast cancer sequences can include both nucleic acid and amino acid 

sequences. As will be appreciated by those in the art and is more fully outlined below, breast 

ii . 

cancer nucleic acid sequences are useful in a variety of applications, including diagnostic 
15 applications, which will detect naturally occurring nucleic acids, as well as screening 

applications; e.g., biochips comprising nucleic acid probes or PCR microtiter plates with 

selected probes to the breast cancer sequences can be generated. 

A breast cancer sequence can be initially identified by substantial nucleic acid 

and/or amino acid sequence homology to the breast cancer sequences outlined herein. Such 
20 homology can be based upon the overall nucleic acid or amino acid sequence, and is 

generally determined as outlined below, using either homology programs or hybridization 

conditions. 

■ 

For identifying breast cancer-associated sequences, the breast cancer screen 
typically includes comparing genes identified in different tissues, e.g., normal and cancerous 
25 tissues, or tumor tissue samples from patients who have metastatic disease vs. non metastatic 
tissue. Other suitable tissue comparisons include comparing breast cancer samples with 
metastatic cancer samples from other cancers, such as lung, breast, gastrointestinal cancers, 
ovarian, etc. Samples of different stages of breast cancer, e.g., survivor tissue, drug resistant 
states, and tissue undergoing metastasis, are applied to biochips comprising nucleic acid 
30 probes. The samples are first microdissected, if applicable, and treated as is known in the art 
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for the preparation of mRNA. Suitable biochips are commercially available, e.g. from 
Affymetrix. Gene expression proves as described herein are generated and the data 
analyzed. 

In one embodiment, the genes showing changes in expression as between 
5 normal and disease states are compared to genes expressed in other normal tissues, preferably 
normal breast, but also including, and not limited to lung, heart, brain, liver, breast, kidney, 
muscle, coloil, small intestine, large intestine, spleen, bone and placenta. In a preferred 
embodiment, those genes identified during the breast cancer screen that are expressed in any 
. significant amount in other tissues are removed from the profile, although in some 
1 0 embodiments, this is not necessary. That is, when screening for drugs, it is usually preferable 
that the target be disease specific, to minimize possible side effects. 

In a preferred embodiment, breast cancer sequences are those that are up- 
regulated in breast cancer, that is, the expression of these genes is higher in the breast cancer 

» 

tissue as compared to non-cancerous tissue. << Up-regulation" as used herein often means at 
15 least about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All unigene cluster identification numbers and accession 
numbers herein are for the GenBank sequence database and the sequences of the accession 
numbers are hereby expressly incorporated by reference. GenBank is known in the art, see, 
e.g., Benson, DA, et aL, Nucleic Acids Research 26:1-7 (1998) and 
20 http://www.ncbi.nlm.nih.gov/. Sequences are also available in other databases, e.g., 

European Molecular Biology Laboratory (EMBL) and DNA Database of Japan (DDBJ). 
U.S. Patent Application N. 09/687,576, with the same assignee as the present application, 
further discloses related sequences, compositions, and methods of diagnosis and treatment of 
breast cancer is hereby expressly incorporated by reference. 

■ * 

25 In another preferred embodiment, breast cancer sequences are those that are 

down-regulated in the breast cancer; that is, the expression of these genes is lower in breast 
cancer tissue as compared to non-cancerous tissue (see, e.g., Tables 1,2, 3, 15, 16 etc...). 
"Down-regulation" as used herein often means at least about a two-fold change, preferably at 
least about a three fold change, with at least about five-fold or higher being preferred 

30 
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Informatics 

The ability to identify genes that are over or under expressed in breast cancer 
can additionally provide high-resolution, high-sensitivity datasets which can be used in the 
5 areas of diagnostics, therapeutics, drug development, pharmacogenetics, protein structure, 
biosensor development, and other related areas. For example, the expression profiles can be 
used in diagnostic or prognostic evaluation of patients with breast cancer. Or as another 
example, subcellular toxicological information can be generated to better direct drug structure 
and activity correlation (see Anderson, Pharmaceutical Proteomics: Targets, Mechanism, 
10 and Function, paper presented at the H3C Proteomics conference, Coronado, CA (June 1 1-12, 
1998)). Subcellular toxicological information can also be utilized in a biological sensor 
device to predict the likely toxicological effect of chemical exposures and likely tolerable 
exposure thresholds (see U.S. Patent No. 5,81 1,231). Similar advantages accrue from 
datasets relevant to other biomolecules and bioactive agents (e.g., nucleic acids, saccharides, 
15 lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of assay data. The data contained in the database is acquired, e.g., 
using array analysis either singly or in a library format. The database can be in substantially 
any form in which data can be maintained and transmitted, but is preferably an electronic 
20 database. The electronic database of the invention can be maintained on any electronic 
device allowing for the storage of and access to the database, such as a personal computer, 
but is preferably distributed on a wide area network, such as the World Wide Web. 

The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
25 databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing breast cancer, i.e., the identification of breast cancer-associated 
sequences described herein, provide an abundance of information, which can be correlated 
30 with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
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gene-disease causal linkages, identification of correlates of immunity and physiological 
status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

5 An array of methods for indexing and retrieving biomolecular information is 

known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 
database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 

10 containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 
for obtaining ftdl-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 

15 between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 
in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 

20 described as on-line analytical processing (OLAP), which entails the consolidation of 

projected and actual data according to more than one consolidation path or dimension. U.S. 
Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 

■ 

25 the merger of two or more such tree structures. 

See also Mount et al, Bioinformatics (2001); Biological Sequence Analysis: 
Probabilistic Models of Proteins and Nucleic Acids (Durbin et al, eds., 1999); 
Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins (Baxevanis & 
Oeullette eds., 1998)); Rashidi & Buehler, Bioinformatics: Basic Applications in Biological 

30 Science and Medicine (1999); Introduction to Computational Molecular Biology (Setubal et 
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a/.,-eds 1997); Bioinformatics: Methods and Protocols (Misener & Krawetz, eds, 2000); 
Bioinformatics: Sequence, Structure, and Databanks: A Practical Approach (Higgins & 
Taylor, eds., 2000); Brown, Bioinforma tics: A Biologist's Guide to Biocomputing and the 

* • 

Internet (2001); Han & Kamber, Data Mining: Concepts and Techniques (2000); and 
5 Waterman, Introduction to Computational Biology: Maps, Sequences, and Genomes (1995). 

The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g., 
with data specifying the source of the target-containing sample from wlrich each sequence 
specificity record was obtained. 

10 In an exemplary embodiment, at least one of the sources of target-containing 

sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for breast cancer. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 

1 5 target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 

i ■ 

coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 

present in the sample. 

The invention also provides for the storage and retrieval of a collection of 

20 target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, arid other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 

25 transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 

* 

identifiers for at least 10 target data records cross-tabulated with target source. 
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When the target is a peptide or nucleic acid, the invention preferably provides 

* 

a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
5 comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 

10 compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 

(e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.) floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 

15 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 

' ► 

ISDN line, wireless network, optical fiber, or other suitable signal transmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 
20 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN teiminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 
25 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
* degree of identity and gap weight to the target data. A central processor is preferably 
30 initialized to load and execute the computer program for alignment and/or comparison of the 
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assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 
5 secondary memory, which is typically random access memory (e.g., DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

10 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.); an I/O device can be a terminal comprising a video display and a keyboard, a modem, 

15 an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 

other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 

» * . 

collection of peptide sequence specificity records obtained by the methods of the invention, 
20 which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 



Characteristics of breast cancer-associated proteins 

25 Breast cancer proteins of the present invention may be classified as secreted 

proteins, transmembrane proteins or intracellular proteins. In one embodiment, the breast 
cancer protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus. Intracellular proteins are involved in all aspects of cellular 
function and replication (including, e.g. , signaling pathways); aberrant expression of such 

30 proteins often results in unregulated or disregulated cellular processes (see, e.g., Molecular 
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Biology of the. Cell (Alberts, ed., 3rd ed., 1994). For example, many intracellular proteins 
have enzymatic activity such as protein kinase activity, protein phosphatase activity, protease 
activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular proteins 
also serve as docking proteins that are involved in organizing complexes of proteins, or 
5 targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 
in the proteins of one or more motifs for which defined functions have been attributed. In 
addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 

10 conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 
domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 
targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 

15 few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

20 One useful database is Pfam (protein families), which is a large collection of multiple 

sequence alignments and hidden Markov models covering many common protein domains. 
Versions are available via the internet from Washington University in. St. Louis, the Sanger 
Center in England, and the Karolinska Institute in Sweden (see, e.g., Bateman et al, Nuc. 
Acids Res. 28:263-266 (2000); Sonnhammer et al, Proteins 28:405-420 (1997); Bateman et 

25 al, Nuc. Acids Res. 27:260-262 (1 999); and Sonnhammer et al, Nuc Acids Res. 26:320-322- 
(1998)). 

In another embodiment, the breast cancer sequences are transmembrane 
. proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 

■ 

They may have an intracellular domain, an extracellular domain, or both. The intracellular 
30 domains of such proteins may have a number of functions including those already described 
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for intracellular proteins. For example, the intracellular domain may have enzymatic activity- 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 
kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 

5 of tyrosines on the receptor molecule itself, creates binding sites for additional SEC domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 
guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

10 transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 

» 

domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 
transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 

15 may be followed by charged amino acids. Therefore, upon analysis of the amino acid 
sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 
Important transmembrane protein receptors include, but are not limited to the insulin 
receptor, insulin-like growth factor receptor, human growth hormone receptor, glucose 

20 transporters, transferrin receptor, epidermal growth factor receptor, low density lipoprotein 
receptor, epidermal growth factor receptor, leptin receptor, interleukin receptors, e.g. DL-1 

receptor, IL-2 receptor, 

The extracellular domains of transmembrane proteins are diverse; however, 
conserved motifs are found repeatedly among various extracellular domains. Conserved 

25 structure and/or functions have been ascribed to different extracellular motifs. Many 

extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 
For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 

30 bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
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cytokines, autogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell, e.g., via a glycosylphosphatidylinositol (GPI) 
anchor, or may themselves be transmembrane proteins. Extracellular domains also associate 
5 with the extracellular matrix and contribute to the maintenance of the cell structure. 

Breast cancer proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 

i 

10 situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically pemieablized to provide access to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 

be made soluble by removing transmembrane sequences, e.g., through recombinant methods. 

it. 

Furthermore, transmembrane proteins that have been made soluble can be made to be 
1 5 secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the breast cancer proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated These proteins have a signal 
peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 
proteins are involved in numerous physiological events; by virtue of their circulating nature, 
20 they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 
on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 

* ■ ■ 

aspects of physiology. Breast cancer proteins that are secreted proteins are particularly 
25 preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood, plasma, serum, or stool tests. 



30 



Use of breast cancer nucleic acids 

4 

As described above, breast cancer sequence is initially identified by 
substantial nucleic acid and/or amino acid sequence homology or linkage to the breast cancer 
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sequences outlined herein. Such homology can be based upon the overall nucleic acid or 
amino acid sequence, and is generally determined as outlined below, using either homology 
programs or hybridization conditions. Typically, linked sequences on a mRNA are found on 
the same molecule. 

5 The breast cancer nucleic acid sequences of the invention, e.g., the sequences 

in Tables 1-25, can be fragments of larger genes, i.e., they are nucleic acid segments. 
"Genes" in tliis context includes coding regions, non-coding regions, and mixtures of coding 
and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the breast cancer genes 
10 can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al 9 supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences corresponding to a single 
gene, e.g., systems such as UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

+ 

Once the breast cancer nucleic acid is identified, it can be cloned and, if 
1 5 necessary, its constituent parts recombined to form the entire breast cancer nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 
segment, the recombinant breast cancer nucleic acid can be further-used as a probe to identify 
and isolate other breast cancer nucleic acids, e.g., extended coding regions. It can also be 
20 used as a precursor" nucleic acid to make modified or variant breast cancer nucleic acids 
and proteins. 

The breast cancer nucleic acids of the present invention , are used in several 
ways. In a first embodiment, nucleic acid probes to the breast cancer nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
25 or for administration, e.g., for gene therapy, vaccine, and/or antisense applications. 

Alternatively, the breast cancer nucleic acids that include coding regions of breast cancer 
proteins can be put into expression vectors for the expression of breast cancer proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to breast cancer nucleic acids 
30 (both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
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made. The nucleic, acid probes attached to the biochip are designed to be substantially 
complementary to the breast cancer nucleic acids, le. the target sequence (either the target 
sequence of the sample or to other probe sequences, e.g., in sandwich assays), such that 
hybridization of the target sequence and the probes of the present invention occurs. As 
5 outlined below, this complementarity need not be perfect; there may be any number of base 
pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 

> 

conditions, the sequence is not a complementary target sequence. Thus, by "substantially 

1 i 
1 0 complementary" herein is meant that the probes are sufficiently complementary to the target 

sequences to hybridize under normal reaction conditions, particularly high stringency 

conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 

and partially double stranded. The strandedness of the probe is dictated by the structure, 

15 composition, and properties of the target sequence. In general, the nucleic acid probes range 

from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 

and from about 30 to about 50 bases being particularly preferred. That is, generally whole 

genes are not used. In some embodiments, much longer nucleic acids can be used, up to 

1 i 

hundreds of bases. 

20 In a preferred embodiment, more than one probe per sequence is used, with 

either overlapping probes or probes to different sections of the target being used. That is, 
two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e., have some sequence in common), 
or separate. In some cases, PGR primers may be used to amplify signal for higher sensitivity. 

* 

25 As will be appreciated by those in the art, nucleic acids can be attached or 

immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By ''non- 

30 covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 

■ 

the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 
1 5 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 
20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluoresce. A preferred substrate is described in copending application entitled Reusable Low 
25 Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 1999, 
herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

* a 

41 



WO 02/059377 PCT/US02/02242 

• • ■ 

* 

sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 
closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, e.g., 
5 the biochip is derivatized with a chemical functional group including, but not limited to,' 
amino groups, carboxy groups, oxo groups and thiol groups, with amino groups being 
particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, e.g. using linkers as are known in the art; e.g., 
10 homo-or hetero-bifunctfonal linkers as are well known (see 1994 Pierce Chemical Company 
catalog, technical section on cross-linkers, pages 155-200). In addition, in some cases, 

additional linkers, such as alkyl groups (including substituted and heteroalkyl groups) may be 

« • 
used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 
1 5 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 
20 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerizatioh 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 
25 in WO 95/25116; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 

i 

I 

level of breast cancer-associated sequences. These assays are typically performed in 
30 conjunction with reverse transcription. In such assays, a breast cancer-associated nucleic acid 
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sequence acts as a template in an amplification reaction (e.g., Polymerase Chain Reaction, or 
PCR). In a quantitative amplification, the amount of amplification product will be 
proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of breast cancer-associated RNA. Methods of 
5 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al , PCR Protocols, A Guide to Methods and 

Applications (1990). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 

10 dye and a 3 * quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5* nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5 ' fluorescent dye and the 3 ' 
quenching agent, thereby resulting in an increase in fluorescence as a function of 

1 5 amplification (see, e.g., literature provided by Perkin-Elmer, e.g., www2.perkin-elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) (see Wu & Wallace, Genomics 4:560 (1989), Landegren et al., Science 
241:1077 (1988), and Barringer et al, Gene 89:1 17 (1990)), transcription amplification 
(Kwoh et al, Proc. Natl. Acad. Sci. USA 86:1 173 (1989)), self-sustained sequence replication 

20 (Guatelli et al, Proc. Nat. Acad. Sci. USA 87: 1874 (1990)), dot PCR, and linker adapter PCR, 
etc. 

Expression of breast cancer proteins from nucleic acids 

In a preferred embodiment, breast cancer nucleic acids, e.g., encoding breast 
25 cancer proteins are used to make a variety of expression vectors to express breast cancer 

proteins which can then be used in screening assays, as described below. Expression vectors 
and recombinant DNA technology are well known to those of skill in. the art (see, e.g., 
Ausubel, supra, and Gene Expression Systems (Fernandez & Hoeffler, eds, 1999)) and are 

* * 

used to express proteins. The expression vectors may be either self-replicating 
30 extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 
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expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the breast cancer protein. The term "control sequences" 
refers to DNA sequences used for the expression of an operably linked coding sequence in a 
particular host organism. Control sequences that are suitable for prokaryotes, e.g., include a 
5 promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

• Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 
secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 

10 that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 

15 do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
nucleic acid will generally be appropriate to the host cell used to express the breast cancer 
protein. Numerous types of appropriate expression vectors, and suitable regulatory 

20 sequences are known in the art for a variety of host cells. 

In general, transcriptional and translational regulatory sequences may include, 
but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 

- 

stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
25 start and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The 
promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 
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In addition, an expression vector may comprise additional elements. For — ~ 
example, the expression vector may have two replication systems, thus allowing it to be » 
maintained in two organisms, e.g. in mammalian or insect cells for expression and in a 
procaryotic host for cloning and amplification. Furthermore, for integrating expression 
5 vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 
The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). 

i 

10 In addition, in a preferred embodiment, the expression vector contains a 

selectable marker gene to allow the selection of transformed host cells. Selection genes are 

* 

well known in the art and will vary with the host cell used. 

The breast cancer proteins of the present invention are produced by culturing a 

»* 

host cell transformed with an expression vector containing nucleic acid encoding a breast 

1 5 cancer protein, under the appropriate conditions to induce or cause expression of the breast 
cancer protein. Conditions appropriate for breast cancer protein expression will vary with the 
choice of the expression vector and the host cell, and will be easily ascertained by one skilled 
in the art through routine experimentation or optimization. For example, the use of 
constitutive promoters in the expression vector will require optimizing the growth and 

20 proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculoviral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 

25 and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the breast cancer proteins are expressed in 

30 mammalian cells. Mammalian expression systems are also known in the art, and include 
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retroviral and adenoviral systems. One expression vector system is a retroviral vector system 
such as is generally described in PCT/US97/01019 and PCT/US97/0 1048, both of which are 
hereby expressly incorporated by reference. Of particular use as mammalian promoters are 
the promoters from mammalian viral genes, since the viral genes are pften highly expressed 
5 and have a broad host range. Examples include the SV40 early promoter, mouse mammary 
tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, 
and the CMV promoter (see, e.g. t Fernandez & Hoeffler, supra). Typically, transcription 
termination and polyadenylation sequences recognized by mammalian cells are regulatory 
regions located 3' to the translation stop codon and thus, together with the promoter elements, 

10 flank the coding sequence. Examples of transcription terminator and poly adenlyation signals 
include those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 

15 polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, breast cancer proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 

20 bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
and hybrid promoters are also useful; e.g., the tac promoter is a hybrid of the trp and lac 
promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 

25 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the breast cancer protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

30 bacterial strains that have been transformed. Suitable selection genes include genes which 
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render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include t biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 
5 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others (e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

i 

In one embodiment, breast cancer proteins are produced in insect cells. 
1 0 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, breast cancer protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 
1 5 Kluyveromyces fragilis and K. lactis, Pichia guillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarrowia lipolytica. 

i 

The breast cancer protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, e.g., for the creation of monoclonal antibodies, if the 
desired epitope is small, the breast cancer protein may be fused to a carrier protein to form an 
20 immunogen. Alternatively, the breast cancer protein may be made as a fusion protein to 
increase expression, or for other reasons. For example, when the breast cancer protein is a 
breast cancer peptide, the nucleic acid encoding the peptide may be linked to other nucleic 
acid for expression purposes. 

■ 

In a preferred embodiment, the breast cancer protein is purified or isolated 
25 after expression. Breast cancer proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the breast cancer protein 
30 may be purified using a standard anti-breast cancer protein antibody column. Ultrafiltration 
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and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
general guidance in suitable purification techniques, see Scopes, Protein Purification (1982). 
The degree of purification necessary will vary depending on the use of the breast cancer 
protein. In some instances no purification will be necessary. 
5 Once expressed and purified if necessary, the breast cancer proteins and 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
reagents, as vaccine reagents, as screening agents, etc. 

Variants of breast cancer proteins 
10 In one embodiment, the breast cancer proteins are derivative or variant breast 

■ * ■ 

cancer proteins as compared to the wild-type sequence. That is, as outlined more fully below, 
the derivative breast cancer peptide will often contain at least one amino acid substitution, 
deletion or insertion, with amino acid substitutions being particularly preferred. The ammo 
acid substitution, insertion or deletion may occur at any residue within the fereast cancer 
15 peptide. 

Also included within one embodiment of breast cancer proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 
three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the breast cancer 
20 protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant breast cancer protein fragments having up to 

■ ' * 

about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
25 a feature that sets thenr apart from naturally occurring allelic or interspecies variation of the 
breast cancer protein amino acid sequence. The variants typically exhibit the same 
qualitative biological activity as the naturally occurring analogue, although variants can also 
be selected which have modified characteristics as will be more fully outlined below. 

While the site or region for introducing an amino acid sequence variiation is 
30 predetermined, the mutation per se need not be predetermined. For example, in order to 
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optimize the performance of a mutation at a given site, random mutagenesis may be— 

'i 

conducted at the target codon or region and the expressed breast cancer variants screened for 
the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, e.g., M13 primer 
5 mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of breast 
cancer protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 

* 

10 some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the breast cancer protein are 

1 5 desired, substitutions are generally made in accordance with the amino acid substitution 

• ■ 

relationships provided in the definition section. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the breast cancer proteins as needed. Alternatively, 

20 the variant may be designed such that the biological activity of the breast cancer protein is 
altered. For example, glycosylation sites may be altered or removed. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those described above. For example, 
substitutions may be made which more significantly affect: the structure of the polypeptide 

25 backbone in the area of the alteration, for example the alpha-helical or beta-sheet structure; 
the charge or hydrophobicity of the molecule at the target site; or the bulk of the side chain. 
The substitutions which in general are expected to produce the greatest changes in the 
polypeptide's properties are those in which (a) a hydrophilic residue, e.g. seryl or threonyl is 
substituted for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or 

30 alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
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having an electropositive side chain, e.g.iysyl, arginyl, or histidyl, is substituted for (or by) 
an electronegative residue, e.g. glutamyl or aspartyl; or (d) a residue having a bulky side 
chain, e.g. phenylalanine, is substituted for (or by) one not having a side chain, e.g. glycine. 

Covalent modifications of breast cancer polypeptides are included within the 
5 scope of this invention; One type of covalent modification includes reacting targeted amino 
acid residues of a breast cancer polypeptide with an organic derivatizing agent that is capable 
of reacting with selected side chains or the N-or C-teiminal residues of a breast cancer 
polypeptide. Derivatization with bifunctional agents is useful, for instance, for crosslinking 
breast cancer polypeptides to a water-insoluble support matrix or surface for use in the 

10 method for purifying anti-breast cancer polypeptide antibodies or screening assays, as is more 
fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, e.g., esters 
with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3,3 , -ditMobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

15 maleimido-l,8-octane and agents such as methyl-3-((p-azidophenyl)ditMo)propioimidate. . 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the amino groups of the lysine, arginine, and histidine side chains (Creighton, 

20 Proteins: Structure and Molecular Properties, pp. 79-86 (1983)), acetylation of the N- 
terminal amine, and amidation of any C-terminal carboxyl group. 

Another type of covalent modification of the breast cancer polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 
of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes 

25 herein to mean deleting one or more carbohydrate moieties found in native sequence breast 
cancer polypeptide, and/or adding one or more glycosylation sites that are not present in the 
native sequence breast cancer polypeptide. Glycosylation patterns can be altered in many 
ways. For example the use of different cell types to express breast cancer-associated 
sequences can result in different glycosylation patterns. 

■ 
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Addition of glycosylation sites to breast cancer polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, e.g., 
by the addition of, or substitution by, one or more serine or threonine residues to the native 
sequence breast cancer polypeptide (for O-linked glycosylation sites). The breast cancer 
5 amino acid sequence may optionally be altered through changes at the DNA level, 

particularly by mutating the DNA encoding the breast cancer polypeptide at preselected bases 
such that codons are generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the 
breast cancer polypeptide is by chemical or enzymatic coupling of glycosides to the 

i 

10 polypeptide. Such methods are described in the art, e.g., in WO 87/05330, and in Aplin & 
Wriston, CRC Crit Rev. Biochem., pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the breast cancer polypeptide 
may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 

15 deglycosylation techniques are known in the art and described, for instance, by Hakhnuddin, 
et al. 9 Arch. Biochem. Biophys., 259:52 (1987) and by Edge et aL 9 Anal. Biochem., 118:131 
(1981 ). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al 9 Meth. 
EnzymoL, 138:350(1987). 

20 Another type of covalent modification of breast cancer comprises linking the 

breast cancer polypeptide to one of a variety of nonproteinaceous polymers, e.g., 
polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Breast cancer polypeptides of the present invention may also be modified in a 

25 way to form chimeric molecules comprising a breast cancer polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 
molecule comprises a fusion of a breast cancer polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 

■ 

generally placed at the amino-or carboxyl-terminus of the breast cancer polypeptide. The 
30 presence of such epitope-tagged forms of a breast cancer polypeptide can be detected using 
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— an antibody against the tag polypeptide. Also, provision of the epitope tag enables the breast 

'i 

cancer polypeptide to be readily purified by affinity purification using an anti-tag antibody or 
another type of affinity matrix that binds to the epitope tag. In an alternative embodiment, 
the chimeric molecule may comprise a fusion of a breast cancer polypeptide with an 
5 immunoglobulin or a particular region of an immunoglobulin. For a bivalent fonn of the 
chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

9 

' Various tag polypeptides and their respective antibodies are well known in the 
art. Examples include poly-histidine (pbly-his) or poly-histidine-glycine (poly-his-gly) tags; 
fflS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 (Field et 

10 al, Mot Cell Biol 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto (Evan et al, Molecular and Cellular Biology 5:3610-3616 (1985)); 
and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody (Paborsky et al, 
Protein Engineering 3(6):547-553 (1990)). Other tag polypeptides include the Flag-peptide 
(Hopp et al, BioTechnology 6:1204-1210 (1988)); the KT3 epitope peptide (Martin et al, 

15 Science 255:192-194 (1992)); tubulin epitope peptide (Skinner et al, J. Biol Chem. 

266:15163-15166 (1991)); and the T7 gene 10 protein peptide tag (Lutz-Freyermuth et al, 
Proc. Natl Acad. Set USA 87:6393-6397 (1990)). 

Also included are other breast cancer proteins of the breast cancer family, and 
breast cancer proteins from other organisms, which are cloned and expressed as outlined 

20 below. Thus, probe or degenerate polymerase chain reaction (PCR) primer sequences may be 
used to find other related breast cancer proteins from humans or other organisms. As will be 
appreciated by those in the art, particularly useful probe and/or PCR primer sequences 
include the unique areas of the breast cancer nucleic acid sequence. As is generally known in 
the art, preferred PCR primers are from about 15 to about 35 nucleotides in length, with from 

25 about 20 to about 30 being preferred, and may contain inosine as needed. The conditions for 
the PCR reaction are well known in the art (e.g., Innis, PCR Protocols, supra). 

Antibodies to breast cancer proteins 

In a preferred embodiment, when the breast cancer protein is to be used to 

30 generate antibodies, e.g., for immunotherapy or immunodiagnosis, the breast cancer protein 

» 
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should share at least one epitope or determinant with the full length protein- By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 
made to a smaller breast cancer protein will be able to bind to the full-length protein, 
5 particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. 

Methods of preparing polyclonal antibodies are known to the skilled artisan 
(e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant 

10 Typically, the inununizing agent and/or adjuvant will be injected in the mammal by multiple 
subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 

15 limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean 
trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

20 The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

antibodies may be prepared using hybridoma methods, such as those described by Kohler & 
Milstein, Nature 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 

25 the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The . 
iinmunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
25 or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ('TBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 

30 fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 
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glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, 

'i 

pp. 59-103 (1986)). Immortalized cell lines are usually transformed mammalian cells, 
particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse 
myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture 
5 medium that preferably contains one or more substances that inhibit the growth or survival of 
the unfused, immortalized cells. For example, if the parental cells lack the enzyme 
hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium 
for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ('THAT 
medium"), which substances prevent the growth of HGPRT-deficient cells. 

10 In one embodiment, the antibodies are bispecific antibodies. Bispecific 

antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
protein encoded by a nucleic acid Tables 1-25 or a fragment thereof, the other one is for any 

1 5 other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to breast cancer protein are capable 
of reducing or eliminating a biological function of a breast cancer protein, as is described 

20 below. That is, the addition of anti-breast cancer protein antibodies (either polyclonal or 

preferably monoclonal) to breast cancer tissue (or cells containing breast cancer) may reduce 
or eliminate the breast cancer. Generally, at least a 25% decrease in activity, growth, size or 
the like is preferred, with at least about 50% being particularly preferred and about a 95- 
100% decrease being especially preferred. 

25 In a preferred embodiment the antibodies to the breast cancer proteins are 

humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labsjnc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 
Fab, Fab*, F(ab')2 or other antigen-binding subsequences of antibodies) which contain 

30 minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
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human immunoglobulins (recipient antibody) in which residues from a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 
affinity and capacity. In some instances, Fv framework residues of the human 
5 immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 
substantially all of the CDR regions correspond to those of a non-human immunoglobulin 

1 0 and all or substantially all of the framework (FR) regions are those of a human 

immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et aL, Nature 321 :522-525 (1986); Riechmann et ql, Nature 
332:323-329 (1988); andPresta, Curr. Op. Struct. Biol 2:593-596 (1992)). Humanization 

15 can be essentially performed following the method of Winter and co-workers (Jones et aL, 
Nature 321:522-525 (1986); Riechmann et aL, Nature 332:323-327 (1988); Verhoeyen et aL, 
Science 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,8 1 6,567), wherein substantially less than an intact 

20 human variable domain has been substituted by the corresponding sequence from a non- 
human species. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom & Winter, J. MoL Biol 227:381 (1991); 
Maries et aL, J. MoL Biol 222:581 (1991)). The techniques of Cole et aL and Boerner et aL 

25 are also available for the preparation of human monoclonal antibodies (Cole et aL , 

Monoclonal Antibodies and Cancer Therapy, p. 77 (1 985) and Boerner et aL, J. Immunol 
147(l):86-95 (1991)). Similarly, human antibodies can be made by introducing of human 
immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous 
immunoglobulin genes have been partially or completely inactivated. Upon challenge, 

30 human antibody production is observed, which closely resembles that seen in humans in all 
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respects, including gene rearrangement, assembly, and antibody repertoire. This approach is 
described, e.g., in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et aL, Bio/Technology 10:779- 
783 (1992); Lonberg et a!., Nature 368:856-859 (1994); Morrison, Nature 368:812-13 

* 

5 (1994); Fishwild et dl. r Nature Biotechnology 14:845-51 (1996); Neuberger, Nature 

Biotechnology 14:826 (1996); Lonberg & Huszar, Intern. Rev. Immunol 13:65-93 (1995). 

By immunotherapy is meant treatment of breast cancer with an antibody raised 
against breast cancer proteins. As used herein, immimotherapy can be passive or active. 
Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 

10 (patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 

15 expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the breast cancer proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 
antibodies used for treatment, bind and prevent the secreted protein from binding to its 
20 receptor, thereby inactivating the secreted breast cancer protein. 

In another preferred embodiment, the breast cancer protein to which 
antibodies are raised is a transmembrane protein. Without being bound by theory, antibodies 
used for treatment, bind the extracellular domain of the breast cancer protein and prevent it 
from binding to other proteins, such as circulating ligands or cell-associated molecules. The 

■ * 

25 antibody may cause down-regulation of the transmembrane breast cancer protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
breast cancer protein. The antibody is also an antagonist of the breast cancer protein. 
Further, the antibody prevents activation of the transmembrane breast cancer protein. In one 

30 aspect, when the antibody prevents the binding of other molecules to the breast cancer 
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protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-cc, TNFrp, IL-1, INF-y 
and IL-2, or chemdtherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 
methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
5 activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, breast cancer is 
treated by administering to a patient antibodies directed against the transmembrane breast 
cancer protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 
otherwise provide means to locally ablate cells. 

10 In another preferred embodiment, the antibody is conjugated to an effector 

moiety. The effector moiety can be any number of molecules, including labelling moieties 
such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In one aspect 
the therapeutic moiety is a small molecule that modulates the activity of the breast cancer 
protein. In another aspect the therapeutic moiety modulates the activity of molecules 

15 associated with or in close proximity to the breast cancer protein. The therapeutic moiety 
may inhibit enzymatic activity such as protease or collagenase or protein kinase activity 
associated with breast cancer. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to breast cancer tissue or cells, results in 

20 a reduction in the number of afflicted cells, thereby reducing symptoms associated with 

breast cancer. Cytotoxic agents are numerous and varied and include, but are not limited to, 
cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 
corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 

■ 

chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
25 radiochemicals made by conjugating radioisotopes to antibodies raised against breast cancer 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. Targeting the therapeutic moiety to transmembrane breast cancer proteins not 
only serves to increase the local concentration of therapeutic moiety in the breast cancer 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
30 the therapeutic moiety. 
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In another preferred embodiment, the breast cancer protein against which the 

% 

antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
to a protein which facilitates entry into the cell. In one case, the antibody enters the cell by 
endocytosis. In another embodiment, a nucleic acid encoding the antibody is administered to 
5 the individual or cell. Moreover, wherein the breast cancer protein can be targeted within a 
cell, i.e., the nucleus, an antibody thereto contains a signal for that target localization, i.e., a 
nuclear localization signal. 

The breast cancer antibodies of the invention specifically bind to breast cancer 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 

* i 

10 K<j of at least abotit 0.1 mM, more usually at least about 1 jiM, preferably at least about 0.1 
\M or better, and most preferably, 0.01 jiM or better. Selectivity of binding is also 
important. 

i 

Detection of breast cancer sequence for diagnostic and therapeutic applications 

15 In one aspect, the RNA expression levels of genes are determined for different 

cellular states in the breast cancer phenotype. Expression levels of genes in normal tissue 
(i.e., not undergoing breast cancer) and in breast cancer tissue (and in some cases, for varying 
severities of breast cancer that relate to prognosis, as outlined below) are evaluated to provide 

* T 

expression profiles. An expression profile of a particular cell state or point of development is 
20 essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 
generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
25 obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or cancerous tissue. This will provide for 
molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
30 patterns within and among cells and tissue. Thus, a differentially expressed gene can 
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qualitatively have its expression altered, deluding an activation or inactivation, in, e.g., 
normal versus breast cancer tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more states. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
5 detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
expression is increased or decreased; i.e., gene expression is either unregulated, resulting in ■ 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 

transcript. The degree to which expression differs need only be large enough to quantify via 

. ' » 

10 standard characterization techniques as outlined below, such as by use of Affymetrix 

* 

GeneChip™ expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), 
hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 

a 

15 about 50%, more preferably at least about 100%, more preferably at least about 150%, more 
preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 

20 final gene product itself (protein) can be monitored, e.g., with antibodies to the breast cancer 
protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to breast 
cancer genes, i.e., those identified as being important in a breast cancer phenotype, can be 
evaluated in a breast cancer diagnostic test. 

25 In a preferred embodiment, gene expression monitoring is performed 

simultaneously on a number of genes. Multiple protein expression monitoring can be 
performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the breast cancer nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of breast cancer sequences in 
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a particular cell. The assays are further described below in the example. PGR techniques can 

► 1 r ■ 

't 

be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the breast cancer protein are 
detected. Although DNA or RNA encoding the breast cancer protein may be detected, of 
5 particular interest are methods wherein an mRNA encoding a breast cancer protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 
complementary to and hybridizes with the mRNA and includes, but is not limited to, 
oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 

1 0 examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 

15 washing to remove the non-specifically bound probe, the label is detected. For example a 
digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding a 
breast cancer protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

20 In a preferred embodiment, various proteins from the three classes of proteins 

as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The breast cancer proteins, antibodies, nucleic acids, modified proteins and cells 
containing breast cancer sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 

25 expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, breast cancer proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of breast cancer. 
30 Detection of these proteins in putative breast cancer tissue allows for detection or diagnosis 
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. „^ of breast cancer. In one embodiment, antibodies are used to detect breast cancer protein A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the breast cancer protein is 
5 detected, e.g., by immunoblotting with antibodies raised against the breast cancer protein. 
Methods of immunoblotting are well known to those of ordinary skill in the art. 

» ■ 

In another preferred method, antibodies to the breast cancer protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 

10 to many antibodies to the breast cancer protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the breast cancer protein(s) 
contains a detectable label, e.g. an enzyme marker that can act on a substrate. In another 

1 5 preferred embodiment each one of multiple primary antibodies contains a distinct and 

detectable label. This method finds particular use in simultaneous screening for a plurality of 
breast cancer proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are also provided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 

20 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing breast 
cancer from blood, serum, plasma, stool, and other samples. Such samples, therefore, are 
useful as samples to be probed or tested for the presence of breast cancer proteins. 

25 Antibodies can be used to detect a breast cancer protein by previously described 
immunoassay techniques including ELISA, immunoblotting (western blotting), 
immunoprecipitation, BIACORE technology and the like. Conversely, the presence of 
antibodies may indicate an immune response against an endogenous breast cancer protein. 

In a preferred embodiment, in situ hybridization of labeled breast cancer 

30 nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
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breast cancer tissue and/or normal tissue, are made, hvsitu hybridization (see, e.g. t Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 
findings. It is further understood that the genes which indicate the diagnosis may differ from 
5 those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 

i 

acids, modified proteins and cells containing breast cancer sequences are used in prognosis 
1 0 assays. As above, gene expression profiles can be generated that correlate to breast cancer, 
in terms of long term prognosis. Again, this may be done on either a protein or gene level, 
with the use of genes being preferred. As above, breast cancer probes may be attached to 
biochips for the detection and quantification of breast cancer sequences in. a tissue or patient. 

if t 

The assays proceed as outlined above for diagnosis. PGR method may provide more 
15 sensitive and accurate quantification. 

Assays for therapeutic compounds 

In a preferred embodiment members of the proteins, nucleic acids, and 
antibodies as described herein are used in drug screening assays. The breast cancer proteins, 

20 antibodies, nucleic acids, modified proteins and cells containing breast cancer sequences are 
used in drug screening assays or by evaluating the effect of drug candidates on a "gene 
expression profile" or expression profile of polypeptides. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
techniques to allow monitoring for expression profile genes after treatment with a candidate 

25 agent (e.g., Zlokamik, et aU Science 279:84-8 (1998); Heid, Genome Res 6:986-94, 1996). 

In a preferred embodiment, the breast cancer proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified breast cancer proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the breast cancer phenotype or an identified physiological 

30 function of a breast cancer protein. As above, this can be done on an individual gene level or 

• > 
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by evaluating the effect of drug-candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra. 
5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level: That is, having identified a particular gene as up regulated in breast cancer, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
breast cancer protein. "Modulation" thus includes both an increase and a decrease in gene 
10 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing breast cancer, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in breast cancer tissue compared to normal 

+ 

tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in breast 
15 cancer tissue compared to normal tissue often provides a target value of a 10-fold increase in 

expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 

and the quantification of gene expression levels, or, alternatively, the gene product itself can 

be monitored, e.g. , through the use of antibodies to the breast cancer protein and standard 
20 immunoassays. Proteomics and separation techniques may also allow quantification of 

expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entities, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically involve a plurality of those entities described herein.. 
25 In this embodiment, the breast cancer nucleic acid probes are attached to 

biochips as outlined herein for the detection and quantification of breast cancer sequences in 
a particular cell. Alternatively, PGR may be used. Thus, a series, e.g., of microtiter plate, 
may be used with dispensed primers in desired wells. A PCR reaction can then be performed 
and analyzed for each well. 

* 
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~- Expression monitoring can be performed to identify compounds that modify 
the expression of one or more breast cancer-associated sequences, e.g., a polynucleotide 
sequence set out iriTable 17. Generally, in a preferred embodiment, a test modulator is added 
to the cells prior to analysis. Moreover, screens are also provided to identify agents that 
5 modulate breast cancer, modulate breast cancer proteins, bind to a breast cancer protein, or 
interfere with the binding of a breast cancer protein and an antibody or other binding partner. 

The term 'test compound" or "drug candidate" or "modulator" or grammatical 
equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 

1 0 indirectly alter the breast cancer phenotype or the expression of a breast cancer sequence, 
e.g., a nucleic acid or protein sequence. In preferred embodiments, modulators alter 
expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses a breast cancer phenotype, e.g. to a normal tissue 
fingerprint. In another embodiment, a modulator induced a breast cancer phenotype. 

1 5 Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 
concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

Drug candidates encompass numerous chemical classes, though typically they 
20 are organic molecules, preferably small organic compounds having a molecular weight of 
more than 100 and less than about 2,500 daltons. Preferred small molecules are less than 
2000, or less than 1500 or less than 1000 or less than 500 D. Candidate agents comprise 
functional groups necessary for structural interaction with proteins, particularly hydrogen 
bonding, and typically include at least an amine, carbonyl, hydroxyl or carboxyl group, 
25 preferably at least two of the functional chemical groups. The candidate agents often 
comprise cyclical carbon or heterocyclic structures and/or aromatic or polyaromatic 
structures substituted with one or more of the above functional groups. Candidate agents are 
also found among biomolecules including peptides, saccharides, fatty acids, steroids, purines, 
pyrimidines, derivatives, structural analogs or combinations thereof. Particularly preferred 
30 are peptides. 
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In one aspect, a modulator will neutralize the effect of a breast cancer protein. 

* • 

By "neutralize" is meant that activity of a protein is inhibited or blocked and the consequent 
effect on the cell 

In certain embodiments, combinatorial libraries of potential modulators will be 
5 screened for an ability to bind to a breast cancer polypeptide or to modulate activity. 

Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound'*) with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating. the property 
and activity of those variant compounds. Often, high throughput screening (HTS) methods 

* 

10 are employed for such an analysis. 

. . In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 

15 display a desired characteristic activity. The compounds thus identified can serve as 

conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 
compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 

20 chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 
compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal, /. Med. Chem. 37(9): 1233-1251 (1994)). 

25 Preparation and screening of combinatorial chemical libraries is well known to 

those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka, Pept. Prot. Res. 37:487-493 
(1991), Houghton et al, Nature, 354:84-88 (1991)), peptoids (PCT Publication No WO 
91/19735), encoded peptides (PCT Publication WO 93/20242), random bio-oligomers (PCT 

30 Publication WO 92/00091), benzodiazepines (U.S. Pat. No. 5,288,5 14), diversomers such as 

■ 
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hydantoins, benzodiazepines and dipeptides-(Hobbs et al, Proc. Nat. Acad. Sci. USA 
90:6909-6913 (1993)), vinylogous polypeptides (Hagihara et al, J. Amer. Chem. Soc. 
114:6568(1 992)), nonpeptidal peptidomimetics with a Beta-D-Glucose scaffolding 
(Eitschmwnetal. 9 J. AmeK Chem. Soc. 114:9217-9218 (1992)), analogous organic syntheses 
5 of small compound libraries (Chen et al, J. Amer. Chem. Soc. 1 16:2661 (1994)), 
oligocarbamates (Cho, et al, Science 261:1303 (1993)), and/or peptidyl phosphonates 
(Campbell etal, J. Org. Chem. 59:658 (1994)). See, generally, Gordon et al, J. Med. Chem. 
37:1385 (1994), nucleic acid libraries (see, e.g. t Strategene, Corp.), peptide nucleic acid 
libraries (see, e.g., XJ,S. Patent 5,539,083), antibody libraries.^, e.g., Vaughn et al, Nature 
10 Biotechnology 14(3):309-314 (1996), and PCT/US96/10287), carbohydrate libraries (see, 
e.g. t Liang et al, Science 21 '4: 1520-1 522 (1996), and U.S. Patent No. 5,593,853), and small 
organic molecule libraries (see, e.g., benzodiazepines, Baum, C&EN, Jan 18, page 33 (1993); 
isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and metathiazanones, U.S. Patent No. , 

i 

5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 5,519,134; morpholino compounds, 
15 U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent No. 5,288,5 14; and the like). 

Devices for the preparation of combinatorial libraries are commercially 
available (see, e.g, 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

20 A number of well known robotic systems have also been developed for 

solution phase chemistries. These systems include automated workstations like the 
automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 

25 synthetic operations performed by a chemist. Any of the above devices are suitable for use 
with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, N.J., Asinex, Moscow, Ru, Tripos, Inc., St Louis, 
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MO, ChemStar, Ltd,JVloscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 

Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening 

Preferred assays thus detect enhancement or inhibition of breast cancer gene transcription, 
5 inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

High throughput assays for the presence, absence, quantification, or other 
properties of particular nucleic acids or protein products are well known to those of skill in 

i 

the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 

10 e.g., U.S. Patent No. 5,559,410 discloses high throughput screening methods for proteins, 
U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic acid 
binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 

15 {see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 

20 as a high degree of flexibility and customization. The manufacturers of such systems provide 
detailed protocols for various high throughput systems. Thus, e.g., Zymark Corp. provides 
technical bulletins describing screening systems for detecting the modulation of gene 
transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 

25 proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 
this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
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the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 
30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
5 to about 1 5 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized*' or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 

10 may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 
most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 

15 preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 
limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, e.g., of hydrophobic amino acids, 
hydrophilic residues, sterically biased (either small or large) residues, towards the creation of 

20 nucleic acid binding domains, the creation of cysteines, for cross-linking, prolines for SH-3 
domains, serines, threonines, tyrosines or histidines for phosphorylation sites, etc., or to 
purines, etc. 

Modulators of breast cancer can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
25 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

68 



• ■ 

* 

* 

WO 02/059377 PCT/US02/02242 

' i 

* 

* 

After the candidate agent has been added and the cells allowed to incubate for 
some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
5 with purification and/or amplification such as PCR performed as appropriate. For example, 
an '-in vitro transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, e.g., a 
fluorescent, a chemilumine§cent, a chemical, or a radioactive signal, to provide a means of 

10 detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 
such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 
appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 

15 epitope tag or biotin which specifically binds to streptavidin. For the example of biotin, the 
streptavidin is labeled as described above, thereby, providing a detectable signal for the 
bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 

20 probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

5.594.1 17, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 

5.594.118, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 

25 conditions that allow the formation of a hybridization complex, 

A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 

30 altering a step parameter that is a thermodynamic variable, including, but not limited to, 

■ . 
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temperature, formamide concentration^ salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
5 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
10 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

* ■ 

The assay data are analyzed to determine the expression levels, and changes in 
15 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the breast cancer phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, for diagnostic applications, having identified differentially 
20 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
25 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress a breast 
cancer expression pattern leading to a normal expression pattern, or to modulate a single 
breast cancer gene expression profile so as to mimic the expression of the gene from normal 
30 tissue, a screen as described above can be performed to identify genes that are specifically 
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modulated in response to the agent. Comparing expression profiles between normal tissue 
and agent treated breast cancer tissue reveals genes that are not expressed in normal tissue or 
breast cancer tissue, but are expressed in agent treated tissue. These agent-specific sequences 
can be identified and used by methods described herein for breast cancer genes or proteins. 
5 In particular these sequences and the proteins they encode find use in marking or identifying 
agent treated cells. In addition, antibodies can be raised against the agent induced proteins 
and used to target novel therapeutics to the treated breast cancer tissue sample. 

Thus, in one embodiment, a test compound is administered to a population of 
breast cancer cells, that have an associated breast cancer expression profile. By 

10 "administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 
encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 

15 the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 1 . 

Once the test compound has been administered to the cells, the cells can be 
washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 

20 generated, as outlined herein. 

Thus, e.g., breast cancer tissue may be screened for agents that modulate, e.g., 
induce or suppress the breast cancer phenotype. A change in at least one gene, preferably 
many, of the expression profile indicates that the agent has an effect on breast cancer activity. 
By defining such a signature for the breast cancer phenotype, screens for new drugs that alter 

25 the phenotype can be devised. With this approach, the drug target need not be known and 
need not be represented in the original expression screening platform, nor does the level of 
transcript for the target protein need to change. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 

30 differentially expressed gene as important in a particular state, screening of modulators of 
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either the expression of the gene or the gene product itself can be done. The gene products- of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins" or 
a "breast cancer modulatory protein". The breast cancer modulatory protein may be a 
fragment, or alternatively, be the full length protein to the fragment encoded by the nucleic 
5 acids of the Tables. Preferably, the breast cancer modulatory protein is a fragment. In a 
preferred embodiment, the breast cancer amino acid sequence which is used to determine 
sequence identity or similarity is encoded by a nucleic acid of Table 25. In another 
embodiment, the sequences are naturally occurring allelic variants of a protein encoded by a 
nucleic acid of Table 25. In another embodiment, the sequences are sequence variants as 

10 further described herein. 

Preferably, the breast cancer modulatory protein is a fragment of 
approximately 14 to 24 amino acids long. More preferably the fragment is a soluble 
fragment. Preferably, the fragment includes a non-transmembrane region. , In a preferred 
embodiment, the fragment has an N-terminal Cys to aid in solubility. In one embodiment, the 

15 C-terminus of the fragment is kept as a free acid and the N-terminus is a free amine to aid in 

coupling, i.e., to cysteine. 

In one embodiment the breast cancer proteins are conjugated to an 
immunogenic agent as discussed herein. In one embodiment the breast cancer protein is 

* * ■ 

conjugated to BS A. 

20 Measurements of breast cancer polypeptide activity, or of breast cancer or the 

breast cancer phenotype can be performed using a variety of assays. For example, the effects 
of the test compounds upon the function of the breast cancer polypeptides can be measured 

* - r\ 

by examining parameters described above. A suitable physiological change that affects 
activity can be used to assess the influence of a test compound on the polypeptides of this 
25 invention. When the functional consequences are determined using intact cells or animals, 
one can also measure a variety of effects such as, in the case of breast cancer associated with 
tumors, tumor growth, tumor metastasis, neovascularization, hormone release, transcriptional 
changes to both known and uncharacterized genetic markers (e.g., northern blots), changes in 
cell metabolism such as cell growth or pH changes, and changes in intracellular second . 
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messengers such as cGMP. In the assays of the invention, mammalian breast cancer 

'i 

polypeptide is typically used, e.g., mouse, preferably human. 

Assays to identify compounds with modulating activity can be performed in 
vitro. For example, a breast cancer polypeptide is first contacted with a potential modulator 
5 and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the breast cancer polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
ELIS A and the like with an antibody that selectively binds to the breast cancer polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 

» 

10 hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 

■ 

preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
detection agents, e.g., fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the breast cancer 

15 protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or 0-gal. The reporter construct is typically transfected into a cell. After 
treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 

20 individual genes and gene products (proteins). That is, having identified a particular 

differentially expressed gene as important in a particular state, screening of modulators of the 
expression of the gene or the gene product itself can be done. The gene products of 
differentially expressed genes are sometimes referred to herein as "breast cancer proteins." 
The breast cancer protein may be a fragment, or alternatively, be the full length protein to a 

25 fragment shown herein. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 
embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
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differentially expressed activity. Moreover, -once initial candidate compounds are identified, 
variants can be further screened to better evaluate structure activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

T " J 

isolated gene product is used; that is, the gene products of one or more differentially 
5 expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present 
Alternatively, cells comprising the breast cancer proteins can be used in the assays. 

Thus, in a preferred embodiment, the methods comprise combining a breast 
cancer protein and a candidate, compound, and determining the binding of the compound to 

10 the breast cancer protein. Preferred embodiments utilize the human breast cancer protein, 
although other mammalian proteins may also be used, e.g. for the development of animal 
models of human disease. In some embodiments, as outlined herein, variant or derivative 
breast cancer proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the breast cancer 

15 protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.)! The insoluble supports may be 
made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 

20 suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc, Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 

25 with the reagents and overall methods of the invention, maintains the activity of the 

composition and is nondiffiisable. Preferred methods of binding include the use of antibodies 
(which do not sterically block either the ligand binding site or activation sequence when the 

* • 

protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
30 the protein or agent, excess unbound material is removed by washing. The sample receiving 
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. areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 

other innocuous protein or other moiety. 

In a preferred embodiment, the breast cancer protein is bound to the support, 

and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
5 support and the breast cancer protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 
analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 

10 protein binding, functional assays (phosphorylation assays, etc.) and the like. 

The determination of the binding of the test modulating compound to the 
breast cancer protein may be done in a number of ways. In a preferred embodiment, the 
compound is labeled, and binding determined directly, e.g., by. attaching all or a portion of 
the breast cancer protein to a solid support, adding a labeled candidate agent (e.g., a 

1 5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

In some embodiments, only one of the components is labeled, e.g., the 
proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., 125 I for the proteins and a fluorophor 

20 for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 

useful. 

In one embodiment, the binding of the test compound is determined by 

• ■ ■ ■ •■. 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e., a breast cancer protein); such as an antibody, peptide, binding partner, ligand, 

25 etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 
embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 

30 40°C. Incubation periods are typically optimized, e.g., to facilitate rapid high throughput 
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screening. Typically between 0.1 and 1 hour will be sufficient Excess reagentis generally 
removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
5 compound. Displacement of the competitor is an indication that the test compound is binding 
to the breast cancer protein and thus is capable of binding to, and potentially modulating, the 
activity of the breast cancer protein. In this embodiment, either component can be labeled. 
Thus, e.g., if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the test compound is labeled, the presence of the 

10 label on the support indicates displacement. 

. In an alternative embodiment, the test compound is added first, with 
incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the breast cancer protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 

1 5 support, coupled with a lack of competitor binding, may indicate that the test compound is 

capable of binding to the breast cancer protein: 

In a preferred embodiment, the methods comprise differential screening to 

identity agents that are capable of modulating the activity of the breast cancer proteins. In 
this embodiment, the methods comprise combining a breast cancer protein and a competitor 
20 in a first sample. A second sample comprises a test compound, a breast cancer protein, and a 
competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 

m 

binding to the breast cancer protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
25 agent is capable of binding to the breast cancer protein. 

Alternatively, differential screening is used to identify drug candidates that 
bind to the native breast cancer protein, but cannot bind to modified breast cancer proteins. 

■ 

The structure of the breast cancer protein may be modeled, and used in rational drug design 
to synthesize agents that interact with that site. Drug candidates that affect the activity of a 
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breast cancer protein are also identified by screening drugs for the ability to either enhance or 

reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 

control and test samples are performed in at least triplicate to obtain statistically significant 
5 results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
and the amount of bound, generally labeled agent determined. For example, where a 
radiolabel is employed, the samples may be counted in a scintillation counter to determine the 

■ • 

amount of bound compound. 
10 . A variety of other reagents may be included in the screening assays. These 

include reagents like salts, neutral proteins, e.g. albumin, detergents, etc. which may be used 
to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 

ij 

protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
15 of components may be added in an order that provides for the requisite binding. 

In a preferred embodiment, the invention provides methods for screening for a 
compound capable of modulating the activity of a breast cancer protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising breast cancer 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
20 nucleic acid that encodes a breast cancer protein. In a preferred embodiment, a library of 
candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 

• - * - 

or subsequent exposure of physiological signals, e.g. hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 

25 chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 
example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate breast cancer agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the breast cancer protein. Once identified, similar structures are evaluated to identify critical 

30 structural feature of the compound. 
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In one embodiment- a method of inhibiting breast cancer cell division is 

provided. The method comprises administration of a breast cancer inhibitor. In another 

embodiment, a method of inhibiting breast cancer is provided. The method comprises 

administration of a breast cancer inhibitor. In a further embodiment, methods of treating cells 

5 or individuals with breast cancer are provided. The method comprises administration of a 

breast cancer inhibitor. 

' In one embodiment, a breast cancer inhibitor is an antibody as discussed 

above. In another embodiment, the breast cancer inhibitor is an antisense molecule. 

A variety of cell growth, proliferation, and metastasis assays are known to 

10 those of skill in the art, as described below. 

Soft agar growth or colony formation in suspension 
Normal cells require a solid substrate to attach and grow. When the cells are 
transformed, they lose this phenotype and grow detached from the substrate. For example, 
transformed cells can grow in stirred suspension culture or suspended in semi-solid media, 

15 such as semi-solid or soft agar. The transformed cells, when transfected with tumor 

suppressor genes, regenerate normal phenotype and require a solid substrate to attach and 
grow. Soft agar growth or colony formation in suspension assays can be used to identify 
modulators of breast cancer sequences, which when expressed in host cells, inhibit abnormal 
cellular proliferation and transformation. A therapeutic compound would reduce or eliminate 

20 the host cells' ability to grow in stirred suspension culture or suspended in semi-solid media, 

such as semi-solid or soft 

Techniques for soft agar growth or colony formation in suspension assays are 
described in Freshney, Culture of Animal Cells a Manual of Basic Technique (3 rd ed., 1994), 
herein incorporated by reference. See also, the methods section of Garkavtsev et al. (1996), 
25 supra, herein incorporated by reference. 

Contact inhibition and density limitation of growth 

Normal cells typically grow in a flat and organized pattern in a petri dish until 
they touch other cells. When the cells touch one another, they are contact inhibited and stop, 
growing. When cells are transformed, however, the cells are not contact inhibited and 
30 continue to grow to high densities in disorganized foci. Thus, the transformed cells grow to a 
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higher saturation density than normal cells. This can be detected morphologically by the 
formation of a disoriented monolayer of cells or rounded cells in foci within the regular 

3 * 

pattern of normal surrounding cells. Alternatively, labeling index with ( H)-thymidine at 
saturation density can be used to measure density limitation of growth. See Freshney (1994), 
5 supra. The transformed cells, when transfected with tumor suppressor genes, regenerate a 
normal phenotype and become contact inhibited and would grow to a lower density. 

In this assay, labeling index with ( 3 H)-thymidine at saturation density is a 
preferred method of measuring density limitation of growth. Transformed host cells are 
transfected with a breast cancer-associated sequence and are grown for 24 hours at saturation 
1 0 density in non-limiting medium conditions. The percentage of cells labeling with ( 3 H)- 
thymidine is determined autoradiographically. See, Freshney (1994), supra. 

Growth factor or serum dependence 

* 

Transformed cells have a lower serum dependence than their normal 
15 counterparts (see, e.g., Temin, 1 Natl. Cancer Insti. 37:167-175 (1966); Eagle et al., J. Exp. 
Med. 131:836-879 (1970)); Freshney, supra. This is in part due to release of various growth 
factors by the transformed cells. Growth factor or serum dependence of transformed host 
cells can be compared with that of control. 

20 Tumor specific markers levels 

Tumor cells release an increased amount of certain factors (hereinafter 'tumor 
specific markers") than their normal counterparts. For example, plasminogen activator (PA) 
is released from human glioma at a higher level than from normal brain cells (see, e.g., 
Gullino, Angiogenesis, tumor vascularization, and potential interference with tumor growth. 
25 in Biological Responses in Cancer, pp. 178-184 (Mihich(ed.) 1985)). Similarly, Tumor 
angiogenesis factor (TAF) is released at a higher level in tumor cells than their normal 
counterparts. See, e.g, Folkman, Angiogenesis and Cancer, Sem Cancer Biol. (1992)). 

Various techniques which measure the release of these factors are described in 

■ 

Freshney (1994), supra. Also, see, Unkless et al. , J. Biol. Chem. 249:4295-4305 (1974); 
30 Strickland & Beers, J. Biol. Chem. 251 :5694-5702 (1976); Whur et al, Br. J. Cancer 42:305- 
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312 (1980); Gullino, Angiogenesis, tumor vascularization, and potential interference with— - 
tumor growth, in Biological Responses in Cancer, pp. 1 78- 1 84 (Mihich (ed.) 1 985); 
Freshney Anticancer Res. 5:111-130 (1985). 

5 Invasiveness into Matrigel 

The degree of invasiveness into Matrigel or some other extracellular matrix 
constituent can be used as an assay to identify compounds that modulate breast cancer- 
associated sequences. Tumor cells exhibit a good correlation between malignancy and 
invasiveness of cells into Matrigel or some other extracellular matrix constituent. In this 
10 assay, tumorigenic cells are typically used as host cells. Expression of a tumor suppressor 
gene in these host cells would decrease invasiveness of the host cells. 

Techniques described in Freshney (1994), supra, can be used. Briefly, the 
level of invasion of host cells can be measured by using filters coated with Matrigel or some 
other extracellular matrix constituent. Penetration into the gel, or through to the distal side of 
15 the filter, is rated as invasiveness, and rated histologically by number of cells and distance 

moved, or by prelabeling the cells with l25 I and counting the radioactivity on the distal side of 
the filter or bottom of the dish. See, e.g., Freshney (1984), supra. 



Tumor growth in vivo 

20 Effects of breast cancer-associated sequences on cell growth can be tested in 

transgenic or immune-suppressed mice. Knock-out transgenic mice can be made, in which 
the breast cancer gene is disrupted or in which a breast cancer gene is inserted. Knock-out 
transgenic mice can be made by insertion of a marker gene or other heterologous gene into 
the endogenous breast cancer gene site in the mouse genome via homologous recombination. 

25 Such mice can also be made by substituting the endogenous breast cancer gene with a 

mutated version of the breast cancer gene, or by mutating the endogenous breast cancer gene, 
e.g., by exposure to carcinogens. 

A DNA construct is introduced into the nuclei of embryonic stem cells. Cells 
containing the newly engineered genetic lesion are injected into a host mouse embryo, which 

30 is re-implanted into a recipient female. Some of these embryos develop into chimeric mice 
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that possess germ cells partially derived from the mutant cell line. Therefore, by breeding the 
chimeric mice it is possible to obtain a new line of mice containing the introduced genetic 
lesion (see, e.g., Capecchi et al t Science 244:1288 (1989)). Chimeric targeted mice can be 
derived according to Hogan et al, Manipulating the Mouse Embryo: A Laboratory Manual, 
5 Cold Spring Harbor Laboratory (1988) and Teratocarcinomas and Embryonic Stem Cells: A . 
Practical Approach, Robertson, ed., IRL Press, Washington, D.C., (1987). 

Alternatively, various immune-suppressed or immune-deficient host animals 
can be used. For example, genetically athymic "nude" mouse (see, e.g., Giovanella et al, J. 
Natl. Cancer Inst. 52:921 (1974)), a SCE) mouse, a thymectomized mouse, or an irradiated 

1 0 mouse (see, e.g., Bradley et al. , Br. J. Cancer 38 :263 (1 978); Selby et al. , Br. J. Cancer 
41:52 (1 980)) can be used as a host. Transplantable tumor cells (typically about 10 6 cells) 
injected into isogenic hosts will produce invasive tumors in a high proportions of cases, while 
normal cells of similar origin will not. In hosts which developed invasive tumors, cells 
expressing a breast cancer-associated sequences are injected subcutaneously. After a suitable 

1 5 length of time, preferably 4-8 weeks, tumor growth is measured (e.g., by volume or by its two 
largest dimensions) and compared to the control. Tumors that have statistically significant 
reduction (using, e.g., Student's T test) are said to have inhibited growth. 

Polynucleotide modulators of breast cancer 

20 Antisense Polynucleotides 

In certain embodiments, the activity of a breast cancer-associated protein is 
down-regulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., a breast cancer protein mRNA, or a subsequence thereof. 

25 Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
30 moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species which are known foruse in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the breast cancer protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
5 means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothibates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block transcription by 

10 binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for breast cancer molecules. A preferred antisense 
molecule is for a breast cancer sequences in Tables 1-25, or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 

15 generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, e.g., Stein & Cohen (Cancer Res. 48:2659 (1988 and 
van der Krol et al. (BioTechniques 6:958 (1988)). 

20 Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of breast cancer-associated nucleotide sequences. A ribozyme is an 
RNA molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes 
have been described, including group I ribozymes, hammerhead ribozymes, hairpin 

25 ribozymes, RNase P, and axhead ribozymes (see, e.g., Castanotto et al, Adv. in 

Pharmacology 25: 289-3 17 (1994) for a general review of the properties of different 
ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al, 
Nucl. Acids Res. 18:299-304 (1990); European Patent Publication No. 0 360 257; U.S. Patent 
30 No. 5,254,678. Methods of preparing are well known to those of skill in the art (see, e.g., 
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WO 94/26877; Ojwang-e/ al, Proc. Natl. Acad. Sci. USA 90:6340-6344 (1 993); Yamada et 
ai, Human Gene Tlterapy 1:39-45 (1994); Leavitt et al, Proc. Natl Acad. Sci. USA 92:699- 

* 

703 (1995); Leavitt et al, Human Gene TTier^ 5: 11 51420 (1994); and Yamada^ a/., 
Virology 205: 121-126(1994)). 
5 Polynucleotide modulators of breast cancer may be introduced into a cell 

containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 
bind to cell surface receptors. , Preferably, conjugation of the ligand binding molecule does 

* ■ 

10 not substantially interfere with the ability of the ligand binding molecule to bind to its 

corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of breast 
cancer may be introduced into a cell containing the target nucleic acid sequence, e.g., by 

K 

formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 
15 understood that the use of antisense molecules or knock out and knock in models may also be 
used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating breast cancer in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 
anti-breast cancer antibody that reduces or eliminates the biological activity of an endogenous 
20 breast cancer protein. Alternatively, the methods comprise administering to a cell or 
organism a recombinant nucleic acid encoding a breast cancer protein. This may be 
accomplished in any number of ways. In a preferred embodiment, e.g. when the breast 
cancer sequence is down-regulated in breast cancer, such state may be reversed by increasing 
the amount of breast cancer gene product in the cell. This can be accomplished, e.g., by 
25 o verexpressing the endogenous breast cancer gene or administering a gene encoding the 
breast cancer sequence, using known gene-therapy techniques, e.g.. In a preferred 
embodiment, the gene therapy techniques include the incorporation of the exogenous gene 
using enhanced homologous recombination (EHR), e.g. as described in PCT/US93/03868, 

■ 

hereby incorporated by reference in its entirety. Alternatively, e.g. when the breast cancer 
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sequence is up-regulated in breast cancer, the activity of the endogenous breast cancer gene is 
decreased, e.g. by the administration of a breast cancer antisense nucleic acid. 

In one embodiment, the breast cancer proteins of the present invention may be 
used to generate polyclonal and monoclonal antibodies to breast cancer proteins. Similarly, 
5 the breast cancer proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify breast cancer 
antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 
embodiment, the antibodies are generated to epitopes unique to a breast cancer protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The breast cancer 
10 antibodies may be' coupled to standard affinity chromatography columns and used to purify 
breast cancer proteins. The antibodies may also be used as blocking polypeptides, as outlined 
above, since they will specifically bind to the breast cancer protein. 

Methods of identifying variant breast cancer-associated sequences 
15 Without being bound by theory, expression of various breast cancer sequences 

is correlated with breast cancer. Accordingly, disorders based on mutant or variant breast 
cancer genes may be determined. In one embodiment, the invention provides methods for 
identifying cells containing variant breast cancer genes, e.g.; determining all or part of the 
sequence of at least one endogenous breast cancer genes in a cell. This may be accomplished 
20 using any number of sequencing techniques. In a preferred embodiment, the invention 

provides methods of identifying the breast cancer genotype of an individual, e.g., determining 
all or part of the sequence of at least one breast cancer gene of the individual. This is 
generally done in at least one tissue of the individual, and may include the evaluation of a 
number of tissues or different samples of the same tissue. The method may include 

■ * 

25 comparing the sequence of the sequenced breast cancer gene to a known breast cancer gene, 
i.e., a wild-type gene. 

The sequence of all or part of the breast cancer gene can then be compared to 
the sequence of a known breast cancer gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
30 embodiment, the presence of a difference in the sequence between the breast cancer gene of 
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the patient and the known breast cancer gene correlates with a disease state or a propensity 
for a disease state, as outlined herein. ,. 

In a preferred embodiment, the breast cancer genes are used as probes to 
determine the number of copies of the breast cancer gene in the genome. 
5 In another preferred embodiment, the breast cancer genes are used as probes to 

determine the chromosomal localization of the breast cancer genes. Information such as 
chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the breast 
cancer gene locus. 

10 

Administration of pharmaceutical and vaccine compositions 

« ■ * 

In one embodiment, a therapeutically effective dose of a breast cancer protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 

15 on the purpose of the treatment, and will be ascertainable by one skilled in the art using 
known techniques (e.g., Ansel et al., Pharmaceutical Dosage Forms and Drug Delivery; 
Liebennan, Pharmaceutical Dosage Forms (vols. 1-3, 1992), Dekker, ISBN 0824770846, 
082476918X, 0824712692, 0824716981; Lloyd, The Art, Science and Technology of 
Pharmaceutical Compounding (1999); and Pickar, Dosage Calculations (1999)). As is 

20 known in the art, adjustments for breast cancer degradation, systemic versus localized 

delivery, and rate of new protease synthesis, as well as the age, body weight, general health, 
sex, diet, time of administration, drug interaction and the severity of the condition may be 
necessary, and will be ascertainable with routine experimentation by those skilled in the art. 
U.S. Patent Application N. 09/687,576, further discloses the use of compositions and 

25 methods of diagnosis and treatment in breast cancer is hereby expressly incorporated by 
reference. 

A "patient" for the purposes of the present invention includes both humans 
and other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
30 preferably a primate, and in the most preferred embodiment the patient is human. 
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The administration of-the breast cancer proteins and modulators thereof of the 
present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneal^, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
5 some instances, e.g., in the treatment of wounds and inflammation, the breast cancer proteins 
and modulators may be directly applied as a solution or spray. 

The pharmaceutical compositions of the present invention comprise a breast 
cancer protein in a form suitable for administration to a patient. In the preferred embodiment, 
the pharmaceutical compositions are in a water soluble form, such as being present as 

1 0 pharmaceutical^ acceptable salts, which is meant to include both acid and base addition 
salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain the 
biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 

1 5 propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 
methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutically acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 

20 manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutically 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trhnethylamine, diethylamine, triethylamine, 

25 tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the 
following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 

» 

agents; coloring agents; and polyethylene glycol. 
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—The pharmaceutical compositions can be administered in a variety of unit 
dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that breast cancer protein modulators (e.g., antibodies, 
5 antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 
molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art 

• ■ 

10 The compositions for administration will commonly comprise a breast cancer 

protein modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous 
carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 

15 contain pharmaceutically acceptable auxiliary substances as required to approximate 

physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
and the like, e.g;, sodium acetate, sodium chloride, potassium chloride, calcium chloride, 
sodium lactate and the like. The concentration of active agent in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and 

20 the like in accordance with the particular mode of administration selected and the patient's 
needs (e.g., Remington 's Pharmaceutical Science (15th ed., 1980) and Goodman & Gillman, 
The Pharmacologial Basis of Therapeutics (Hardman et a/.,eds., 1996)). 

Thus, a typical pharmaceutical composition for intravenous administration 
would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 

25 patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 
in the art, e.g. 9 Remington *s Pharmaceutical Science and Goodman and Gillman, The 

30 Pharmacologial Basis of Therapeutics, supra. 
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The compositions containing modulators of breast cancer proteins can be m ~~ 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 
amount sufficient to cure or at least partially arrest the disease and its complications. An 
5 amount adequate to accomplish this is defined as a ''therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient In any event, 
the composition should provide a sufficient quantity of the agents of this invention to 

1 0 effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 
factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 

15 treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 

recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 

> 

of developing cancer. 

It will be appreciated that the present breast cancer protein-modulating 
compounds can be administered alone or in combination with additional breast cancer 
20 modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-25, such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
25 methods, reagents, vectors, and cells useful for expression of breast cancer-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
30 introducing foreign nucleotide sequences into host cells may be used. These include the use 
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of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g, Berger & Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
5 volume 152 (Berger), Ausubel et al, eds., Current Protocols (supplemented through 1999), 
and Sambrook et al, Molecular Cloning - A Laboratory Manual (2nd ed., Vol. 1-3, 1989. 

' In a preferred embodiment, breast cancer proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
breast cancer genes (including both the full-length sequence, partial sequences, or regulatory 
10 sequences of the breast cancer coding regions) can be administered in a gene therapy 
application. These breast cancer genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

* 

Breast cancer polypeptides and polynucleotides can also be administered as 
15 vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 

compositions can include, e.g., lipidated peptides (see, e.g.,Vitiello, A. et al 9 J. Clin. Invest. 
95:341 (1995)), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) ('TLG") 
microspheres (see, e.g., Eldridge, et al, Molec. Immunol. 28:287-294, (1991); Alonso et al., 
Vaccine 12:299-306 (1994); Jones et al, Vaccine 13:675-681 (1995)), peptide compositions 
20 contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et al., Nature 
344:873-875 (1990); Hu et al., Clin Exp Immunol. 1 13:235-243 (1998)), multiple antigen 
peptide systems (MAPs) (see, e.g., Tarn, Proc. Natl. Acad. Sci. U.S.A. 85:5409-5413 (1988); 
Tarn, J. Immunol. Methods 196:17-32 (1996)), peptides formulated as multivalent peptides; 
peptides for use in ballistic delivery systems, typically crystallized peptides, viral delivery 
25 vectors (Perkus, et al. , In: Concepts in vaccine development (Kaufmann, ed., p. 379, 1996); 
Chakrabarti, et al, Nature 320:535 (1986); Hu et al, Nature 320:537 (1986); Kieny, et al, 

t * 

AIDS Bio/Technology 4:790 (1986); Top et al, J. Infect. Dis. 124:148 (1971); Chanda et al, 
Virology 175:535 (1990)), particles of viral or synthetic origin (see, eg., Kofler et al.,J. 
Immunol. Methods. 192:25 (1996); Eldridge et al, Sent. Hematol. 30:16 (1993); Falo et al, 
30 Nature Med. 7:649 (1995)), adjuvants (Warren et al, Annu. Rev. Immunol. 4:369 (1986); 
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Gupta et al., Vaccine 11:293 (1993)), Uposomes-(Reddy et al, J. Immunol. 148:1585 (1992); 
Rock, Immunol. Today 17:131 (1996)), or, naked or particle absorbed cDNA (Ulmer, et al., 
Science 259: 1745 (1993); Robinson et al., Vaccine 1 1 :957 (1993); Shiver et al., In: Concepts 
in vaccine development (Kaufinann, ed., p. 423, 1996); Cease & Berzofsky, Annu. Rev. 
5 Immunol. 12:923 (1994) and Eldridge et al., Sem. Hematol. 30:16 (1993)). Toxin-targeted 
delivery technologies, also known as receptor mediated targeting, such as those of Avant 

■ ' 

Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 

i * 

substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
10 or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 
Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, e.g., Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, 
MI); Merck Adjuvant 65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline 

V. 

Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel (alum) or 
15 aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of acylated 
tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 

20 Vaccines can be administered as nucleic acid compositions wherein DNA or 

RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 

- 

WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
25 include~"naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 

cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
(see, e.g., U.S. Patent No. 5,922,687). ' 

For therapeutic or prophylactic immunization purposes, the peptides of the 
invention can be expressed by viral or bacterial vectors. Examples of expression vectors 
30 include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
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vaccinia virus, e.g., as a vector to express nucleotide sequences that encode breast cancer 
polypeptides or polypeptide fragments. Upon introduction into a host, the recombinant 
vaccinia virus expresses the immunogenic peptide, and thereby elicits an immune response. 
Vaccinia vectors and methods useful in immunization protocols are described in, e.g., U.S. 
5 Patent No. 4,722,848. Another vector is BCG (Bacille Calmette Guerin). BCG vectors are 
described in Stover et al, Nature 35 1 :456-460 (1991). A wide variety of other vectors useful 

■ * * 

for therapeutic administration or immunization e.g. adeno and adeno-associated virus vectors, 
retroviral vectors, Salmonella typhi vectors, detoxified anthrax toxin vectors, and the like, 
will be apparent to those skilled in the art from the description herein (see, e.g., Shata et al. 9 

10 Mol Med Today 6:66-71 (2000); Shedlocke/ al., JLeukocBiol 68:793-806 (2000); Hipper 
al, In Vivo 14:571-85 (2000)). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing a breast cancer gene or portion of a breast cancer gene under the control of a 
regulatable promoter or a tissue-specific promoter for expression in a breast cancer patient. 

15 The breast cancer gene used for DNA vaccines can encode full-length breast cancer proteins, 
but more preferably encodes portions of the breast cancer proteins including peptides derived 
from the breast cancer protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from a breast cancer gene. For 
example, breast cancer-associated genes or sequence encoding subfragments of a breast 

20 cancer protein are introduced into expression vectors and tested for their immunogenicity in 
the context of Class I MHC and an ability to generate cytotoxic T cell responses. This 
procedure provides for production of cytotoxic T cell responses against cells which present 
antigen, including intracellular epitopes. 

In a preferred embodiment, the DNA vaccines include a gene encoding an 

25 adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the breast cancer polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 

In another preferred embodiment breast cancer genes find use in generating 

4 

animal models of breast cancer. When the breast cancer gene identified is repressed or 
30 diminished in cancer tissue, gene therapy technology, e.g., wherein antisense RNA directed 
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to the breast cancer gene will also diminish or repress expression of the gene, Animal models 
of breast cancer find use in screening for modulators of a breast cancer-associated sequence 
or modulators of breast cancer. Similarly, transgenic animal technology including gene 
knockout technology, e.g. as a result of homologous recombination with an appropriate gene 
5 targeting vector, will result in the absence or increased expression of the breast cancer 
protein. When desired, tissue-specific expression or knockout of the breast cancer protein 
may be necessary. 

It is also possible that the breast cancer protein is overexpressed in breast 
cancer. As such, transgenic animals can be generated that overexpress the breast cancer 

10 protein. Depending on the desired expression level, promoters of various strengths can be 

- 

employed to express the transgene. Also, the number of copies of the integrated transgene 
can be determined and compared for a determination of the expression level of the transgene. 

Animals generated by such methods find use as animal models of breast cancer and are 

»*. 

additionally useful in screening for modulators to treat breast cancer. 

15 

* * 

Kits for Use in Diagnostic and/or Prognostic Applications 

For use in diagnostic, research, and therapeutic applications suggested above, 
kits are also provided by the invention. In the diagnostic and research applications such kits 

< * . 

may include any or all of the following: assay reagents, buffers, breast cancer-specific nucleic 
20 acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 

ribozymes, dominant negative breast cancer polypeptides or polynucleotides, small molecules 
inhibitors of breast cancer-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
25. (i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 
medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
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like. Such media may include addresses to internet sites that-provide such instructional 

materials. ■ ■ 

The present invention also provides for kits for screening for modulators of 

breast cancer-associated sequences. Such kits can be prepared from readily available 
5 materials and reagents. For example, such kits can comprise one or more of the following 
materials: a breast cancer-associated polypeptide or polynucleotide, reaction tubes, and 
instructions for testing breast cancer-associated activity. Optionally, the kit contains 
biologically active breast cancer protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
10 the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 
parameters in disease which may be identified in historical or outcome data. 

H. 

r 

15 EXAMPLES 

Example 1 : Tissue Preparation, Labeling Chips, and Fingerprints 

a a 
* ■ 

a 1 ■ 

Purifying total RNA from tissue sample using TRIzol Reagent 

20 . The sample weight is first estimated. The tissue samples are homogenized in 

1 ml of TRIzol per 50 mg of tissue using a homogenizer (e.g., Polytron 3100). The size of 
the generator/probe used depends upon the sample amount. A generator that is too large for 
the amount of tissue to be homogenized will cause a loss of sample and lower RNA yield. A 
larger generator (e.g., 20 mm) is suitable for tissue samples weighing more than 0.6 g. Fill 

25 tubes should not be overfilled. If the working volume is greater than 2 ml and no greater than 
10 ml, a 15 ml polypropylene tube (Falcon 2059) is suitable for homogenization. 

Tissues should be kept frozen until homogenized. The TRIzol is added 
directly to the frozen tissue before homogenization. Following homogenization, the insoluble 

■ 

material is removed from the homogenate by centrifugation at 7500 x g for 15 min. in a 

30 Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf centrifuge at 4°C. The cleared 
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homogenate is then transferred to a new tube(s). Samples may be frozen and stored at -60 to 

-70°C for at least one month or else continue with the purification. 

The next process is phase separation. The homogenized samples are incubated 
for 5 minutes at room temperature. Then, 0.2 ml of chloroform per 1ml of TRIzol reagent is 
5 added to the homogenization mixture. The tubes are securely capped and shaken vigorously 
by hand (do not vortex) for 15 seconds. The samples are then incubated at room temp, for 
2-3 minutes and next centrifuged at 6500 rpm in a Sorvall superspeed for 30 min. at 4oC. 

The next process is RNA Precipitation. The aqueous phase is transferred to a 
fresh tube. The organic phase can be saved if isolation of DNA or protein is desired. Then 
1 0 0.5 ml of isopropyl alcohol is added per 1ml of TRIzol reagent used in the original 

homogenization. Then, the tubes are securely capped and inverted to mix. The samples are 
then incubated at room temp, for 10 minutes an centrifuged at 6500 rpm in Sorvall for 20 

min. at 4<>C. 

The RNA is then washed. The supernatant is poured off and the pellet washed 
15 with cold 75% ethanol. 1 ml of 75% ethanol is used per 1 ml of the TRIzol reagent used in 
the initial homogenization. The tubes are capped securely and inverted several times to 

■ ■ 

loosen pellet without vortexing . They are next centrifuged at <8000 rpm (<7500 x g) for 5 
minutes at 4°C. 

The RNA wash is decanted. The pellet is carefully transferred to an 
20 Eppendorf tube (sliding down the tube into the new tube by use of a pipet tip to help guide it 
in if necessary). Tube(s) sizes for precipitating the RNA depending on the working volumes. 
Larger tubes may take too long to dry. Dry pellet. The RNA is then resuspended in an 
appropriate volume (e.g., 2 -5 ug/ul) of DEPC H 2 0. The absorbance is then measured. 

The poly A+ mRNA may next be purified from total RNA by other methods 

25 such as Qiagen' s RNeasy kit. The poly A + mRNA is purified from total RNA by adding the 
oligotex suspension which has been heated to 37°C and mixing prior to adding to RNA. 
The Elution Buffer is incubated at 70°C. If there is precipitate in the buffer, warm up the 2 x 
Binding Buffer at 65°C. The the total RNA is mixed with DEPC-treated water, 2 x Binding 
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• . ■ 

Buffer, and Oligotex according to Table 2 on page 16 of the Oligotex Handbook and next 
incubated for 3 minutes at 65°C and 10 minutes at room temperature. 

T i 

The preparation is centrifuged for 2 minutes at 14,000 to 18,000 g, preferably, 
at a "soft setting," The supernatant is removed without disturbing Oligotex pellet. A little bit 
5 of solution can be left behind to reduce the loss of Oligotex. The supernatant is saved until 

satisfactory binding and elution of poly A + mRNA has been found. 

Then, the preparation is gently resuspended in Wash Buffer OW2 and pipetted 

onto the spin column and centrifuged at full speed (soft setting if possible) for 1 minute. 

Next, the spin column is transferred to a new collection tube and gently 
10 resuspended in Wash Buffer OW2 and centrifuged as described herein- 
Then, the spin column is transferred to a new tube and eluted with 20 to 100 ul 

of preheated (70°C) Elution Buffer. The Oligotex resin is gently resuspended by pipetting up 
and down. The centrifugation is repeated as above and the elution repeated.with fresh elution 
buffer or first eluate to keep the elution volume low. 

1 5 The absorbance is next read to determine the yield, using diluted Elution ■ 

Buffer as the blank. 

Before proceeding with cDNA synthesis, the mRNA is precipitated before 
proceeding with cDNA synthesis, as components leftover or in the Elution Buffer from the 
Oligotex purification procedure will inhibit downstream enzymatic reactions of the mRNA. 

20 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. of cold 1 00% ethanol is added and the preparation 
precipitated at -20°C 1 hour to overnight (or 20-30 min. at -70°C), and centrifuged at 
14,000-16,000 x g for 30 minutes at 4°C. Next, the pellet is washed with 0.5 ml of 80% 
ethanol (-20°C) and then centrifuged at 14,000-16,000 x g for 5 minutes at room temperature. 
The80% ethanol wash is then repeated. The last bit of ethanol from the pellet is then dried 

25 without use of a speed vacuum and the pellet is then resuspended in DEPC H2O at lug/ul 
concentration. 

Alternatively the RNA mav be purified using other methods ( e.g.. Oiagen's RNeasv kit). 
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■ * ■ 

No more than 100 ug is added to the KNeasy column. The sample volume is ~ 
adjusted to 1 00 ul with RNase-free water. 350 ul Buffer RLT and then 250 ul ethanol 
(100%) are added to the sample. The preparation is then mixed by pipetting and applied to an 
RNeasy mini spin column for centrifugation (15 sec at >10,000 rpm).. If yield is low, reapply 
5 the flowthrough to the column and centrifuge again. 

Then, transfer column to a new 2 ml collection tube and add 500 ul Buffer 
RPE and centrifuge for 15 sec at >10,000 rpm. The flowthrough is discarded. 500 ul Buffer 
RPE and is then added and the preparation is centriuged for 15 sec at >1 0,000 rpm. The 
flowthrough is discarded, and the column membrane dried by centrifuging for 2 min at 
10 maximum speed. The column is transferred to a new 1.5-ml collection tube. 30-50 ul of 

RNase-free water is applied directly onto column membrane. The column is then centrifuged 
for 1 min at >1 0,000 rpm and the elution step repeated. 

The absorbance is then read to determine yield. If necessary, the material may 
be ethanol precipitated with ammonium acetate and 2.5X volume 1 00% ethanol. 

15 

FiTst Strand cDNA Synthesis 

* * 

The first strand can be make using using Gibco's "Superscript Choice System 
for cDNA Synthesis" kit. The starting material is 5 ug of total RNA or 1 ug of polyA+ 
mRNAl. For total RNA, 2 ul of Superscript RT is used; for polyA+ mRNA, 1 ul of 
20 Superscript RT is used. The final volume of first strand synthesis mix is 20 ul. The RNA 
should be in a volume no greater than 10 ul. The RNA is incubated with 1 ul of 100 pmol 

T7-T24 oligo for 10 min at 70°C followed by addition on ice of 7 ul of: 4ul 5X 1 st Strand 
Buffer, 2 ul of 0.1M DTT, and 1 ul of lOmM dNTP mix. The preparation is then incubated at 
37°C for 2 min before addition of the Superscript RT followed by incubation at 37°C for 1 
25 hour. 

ftecond Strand Synthesis 

For the second strand synthesis, place 1st strand reactions on ice and add: 91 

ul DEPC H 2 0; 30 ul 5X 2nd Strand Buffer; 3 ul lOmM dNTP mix; 1 ul 10 U/ul E.coli DNA 

30 Ligase; 4 ul 10 U/ul E.coli DNA Polymerase; and 1 ul 2 U/ul RNase H. Mix and incubate 2 
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hours at 1 6°G. Add 2 ul T4 DNA Polymerase. Incubate 5 min at 16°C.~Add 10 ul of 0.5M 
EDTA. 

* 

i 

Cleaning up cDNA 

5 The cDNA is purified using Phenol:Chloroform:Isoamyl Alcohol (25:24:1) 

and Phase-Lock gel tubes. The PLG tubes are centrifuged for 30 sec at maximum speed. 
The cDNA mix is then transferred to PLG tube. An equal volume of 
phenol :chloroform:isamyl alcohol is then added, the preparation shaken vigorously (no 
vdrtexing), and centrifuged for 5 minutes at maximum speed. The top aqueous solution is 
10 transferred to a new tube and ethanol precipitated by adding 7.5X 5M NH40Ac and 2.5X 
volume of 100% ethanol. Next, it is centrifuged immediately at room temperature for 20 
min, maximum speed. The supernatant is removed, and the pellet washed with 2X with cold 
80% ethanol. As much ethanol wash as possible should be removed before air drying the 
pellet; and resuspending it in 3 ul RNase-free water. 

15 

In vitro Transcription (TVT) and labeling with biotin 

In vitro Transcription (JVT) and labeling with biotin is performed as follows: 
Pipet 1 .5 ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2 ul 
T7 lOxATP (75 mM) (Ambion); 2 ul T7 lOxGTP (75 mM) (Ambion); 1.5 ul T7 lOxCTP (75 
20 mM) (Ambion); 1.5 ul T7 lOxUTP (75 mM) (Ambion); 3.75 ul 10 mM Bio-ll-UTP 

(Boehringer-Mannheim/Roche or Enzo); 3.75 ul 10 mM Bio-16-CTP (Enzo); 2 ul lOx T7 

transcription buffer (Ambion); and 2 ul lOx T7 enzyme mix (Ambion). The final volume is 

* * " ■■ 

20 ul. Incubate 6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 
Clean-up follows the previous instructions for RNeasy columns or Qiagen's RNeasy protocol 
25 handbook. The cRNA often needs to be ethanol precipitated by resuspension in a volume 
compatible with the fragmentation step. 

Fragmentation is performed as follows. 15 ug of labeled RNA is usually 
fragmented. Try to minimize the fragmentation reaction volume; a 1 0 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
30 the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
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RNA by incubation at 94 C for 35 minutes in 1 ^Fragmentation buffer (5 x Fragmentation 
buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 

- 

RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
5 of the transcript size range 

For hybridization, 200 ul (10 ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50 ng/ul final cone); 50 pM 948-b control 
10 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; 0.1 mg/ml herring sperm DNA; 
0.5 mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

The hybridization reaction is conducted with non-biotinylated IVT (purified 
by RNeasy columns) (see example 1 for steps from tissue to IVT): The following mixture is 
prepared: 

15 IVT antisense RNA; 4 ug: j-il 

Random Hexamers (1 \xg/\il): 4 jil 
H 2 0: ill 

14 Hi 

Incubate the above 14 pi mixture at 70°C for 10 min.; then put on ice. 

■ 

20 The Reverse transcription procedure uses the following mixture: 

0.1MDTT: 3 ul 

50X dNTP mix: 0.6 ul 

H 2 0: 2.4 ul 

Cy3 or Cy5 dUTP (ImM): 3 |xl 
25 SS RT H (BRL): 1 ul 



16 ul 

The above solution is added to the hybridization reaction and incubated for 30 min., 42°C. 

Then, 1 ul SSII is added and incubated for another hour before being placed on ice. 
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The 50X dNTP mix contains 25mM of cold dATP, dCTP, and dGTP, lOmM 
of dTTP and is made by adding 25 \i\ each of lOOmM dATP, dCTP, and dGTP; 10 jil of. 
lOOmM dTTP to 15 ^1 H 2 0. ] 

RNA degradation is performed as follows. Add 86 nl H 2 0, 1 .5 |xl 1M NaOH/ 
2 mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 ul TE/sample spin at 7000 g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500 ul buffer PB and proceed using Qiagen protocol. For DNAse 
digestion, add 1 ul of 1/100 dilution of DNAse/30 ul Rx and incubate at 37°C for 15 min. 
Incubate at 5 min 95°C to denature the DNAse. 



Sample preparation 

For sample preparation, add Cot-1 DNA, 10 p.1; 50X dNTPs, 1 ul; 20X SSC, 2.3 |xl; Na pyro 
phosphate, 7.5 ul; 10 mg/ml Herring sperm DNA; 1 ul of 1/10 dilution to 21.8 final vol. Dry 
in speed vac. Resuspendin 15 ulH 2 0. Add 0.38 ul 10% SDS. Heat 95°C, 2 min and slow 
1 5 cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. Washing after 
the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 ml 20X SSC+0.75ml 10% SDS in 
250ml H 2 0; IX SSC: 5 min., 12.5 ml 20X SSC in 250ml H 2 0; 0.2X SSC: 5 min., 2.5 ml 20X 
SSC in 250ml H 2 0. Dry slides and scan at appropiate PMT's and channels. 
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TABLE 1: Figure 1 from BRCA 001 US 

i - 

5 Table 1 shows genes, (incorporated in their entirety here and throughout the application 
where primekeys are provided), downregulated in tumor tissue compared to normal breast 
tissue. 



10 



15 R1: 



Pkey. 
ExAccn: 
UnigenelD: 
Unlgene Title: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitle 

20 100472 D90084 Hs.1023 pyruvate dehydrogenase (Itpoamicie) alpha 

100499 T51986 Hs.283108 hemoglobin, gamma G 

100545 M55405, gb:Homo sapiens mucin (MUC-3) mRNA, part 

100549 BE142019 Hs.222056 Homo sapiens cDNA FLJ1 1572 fis, clone HE 

100613 X52078 Hs.101047 transcription factor 3 (E2A immunogiobul 

25 100635 BE259039 Hs.129953 Ewing sarcoma breakpoint region 1 

100645 X16641 Hs.167986 neural cell adhesion molecule 1 

100654 A03758 NM_000477*:Homo sapiens albumin (ALB), m 

100702 L27065 gb:Human neurofibromatosis 2 (NF2) mRNA, 

100915 M60832 Hs.249239 collagen, type VIII, alpha 2 

30 100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 

101125 AJ250562 Hs.82749 transmembrane 4 superfamily member 2 

101166 M90424 . Hs.2099 lipocallnl (protein migrating fester th 

101184 NM001674 Hs.460 activating transcription factor 3 

101336 NM_006732 Hs.75678 FBJ murine osteosarcoma viral oncogene h 

35 101367 X03350 Hs.4 alcohol dehydrogenase 18 (class I), beta 

101447 M21305 gb:Human alpha satellite and satellite 3 

101461 N98569 Hs76422 phosphollpase A2. group IIA (platelets, 

101511 M27826 Hs.267319 endogenous retroviral protease 

101634 AV650262 Hs75765 GR02 oncogene 

40 101736 M74447 Hs.502 transporter 2, ATP-binding cassette, sub 

102208 U22961 gb:Human mRNA clone with similarity to L 

102297 NIVL001504 Hs.198252 G protein-coupled receptor 9 

102450 U48251 Hs75871 protein kinase C binding protein 1 

. 102515 U89337 Hs.169886 tenascinXB 

45 102571 U601 15 . Hs.239069 four and a half UM domains 1 

102800 AA313538 gb:EST185419 Colon carcinoma (HCC)ceQ 

102857 NM-006744 Hs76461 retinol-binding protein 4, interstitial 

102990 AA829286 Hs.332053 serum amyloid A1 

103434 X98085 Hs.54433 tenascin R (restrictin, Janusin) 

50 103747 AA081995 gbzn26d06.fi Stratagene neuroepithelium 

103750 AA126129 gb:zm78c07.r1 Stratagene neuroepithelium 

103812 AA137107 Hs.326391 Homo sapiens, clone MGC:16638, mRNA, com 

103851 AA326216 Hs.8719 hypothetical protein MGC1 136 

104080 AB041036 Hs.57771 kallikreln 1 1 (KLK11; TLSP; PRSS20; hlpp 

55 104093 R50727 Hs.336970 ESTs 

104106 AA422123 gb:zv26h12.ii Soares_NhHMPu_S1 Homosapi 

104109 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 

104250 F06638 Hs.12440 Homo sapiens clone 24734 mRNA sequence 

104340 AA426189 gb:zw11e09.r1 SoaresJJhHMPu_S1 Homosapi 

60 104492 N73185 Hs.94285 EST 

104506 N91071 Hs.109650 ESTs 

104511 N99542 Hs.572 orosomucoidl 

104532 AI498763 Hs.203013 hypothetical protein FUI1 2748 

100 
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104536 

104572 

104659 

104677 

104711 

104731 

104764 

105005 

105036 

105105 

105231 

105239 

105921 

105957 

106052 

106119 

106181 

106194 

106283 

106379 

106451 

106491 

106700 

106782 

106851 

106870 

106892 

106954 

106991 

107103 

107124 

107148 

107214 

107242 

107331 

107351 

107423 

107447 

107451 

107453 

107459 

107683 

107711 

107754 

107757 

107864 

107872 

107888 

107997 

108056 

108081 

108113 

108238 

108257 

108335 

108351 

108382 

108392 

108441 

108446 

108497 

108604 

108662 

108706 

108738 

108827 

109123 



R24024 

Y11312 

AW969769 

AA009764 

AA017245 

AA019300 

AI039243 

AI298208 

AA130390 

R61532 

AW970043 

AA221036 

AA421973 

BE242857 

N79885 

AL359624 

AI803651 

AW976171 

AS085848 

AL042069 

AW235928 

AA135688 

AA906434 

AW0548B6 

AI456623 

AI983730 

AI347578 

AF128847 

AJ223811 

AI446183 

AB006532 

AI005036 

AF127026 

AB020672 

AI905985 

U51704 

W26652 

W28516 

AL042425 

A1092790 

W38002 

N53167 

W96141 

AA017462 

BE621721 

AA025060 

BE271708 

AA025836 

AL049176 

AA043675 

AA093668 

AA012881 

AA059473 

AA677927 

AA070500 

AA071193 

NNLQ06770 

AA075124 

AA079079 

AA085383 

AA074897 

AA934589 

AF1 17646 

AA121820 

AA126583 

AI273692 

AI028376 



Hs.158101 Homo sapiens cDNA FU14673 fis, clone NT 
Hs.132463 phosphoinositide-3-kinase, class 2, beta 
Hs.105201 ESTs 
Hs.190380 ESTs 
Hs,32794 ESTs 

Hs.125070 ESTs, Moderately similar to J54374 gene 
Hs.278585 ESTs 
Hs.28805 ESTs 

Hs.25549 hypothetical protein FU20898 
Hs.87016 hypothetical protein FU22938 
Hs.238039 hypothetical protein RJ1 1090 

gb:zr03f12.rl Stratagene NT2 neuronal pr 
Hs.169119 ESTs, Weakly similar to T25731 hypotheti 
Hs.27021 hypothetical protein FU11159 
Hs.6382 ESTs, Highly similar to T00391 hypotheti 
Hs.11387 KIAA1453 protein 
Hs.1 91608 ESTs 

Hs.286194 hypothetical protein FU22233 
Hs.25522 K1AA1808 protein 
Hs.119021 DKF2P434N061 protein 
Hs.3,13182 ESTs 

Hs.10083 Homo sapiens, clone IMAGE:41 39786, mRNA, 

Hs.3776 zinc finger protein 216 

Hs.25682 Homo sapiens mRNA for K1AA1863 protein, 

gb:tk04gO9a1 NCLCGAPJ.U24 Homo sapiens 
Hs.26530 serum deprivation response (phosphatidyl 
Hs.124015 hypothetical protein MGC2605 
Hs.204038 indoIethylamineN-methyrtransferase 
Hs.30127 hypothetical protein 
Hs.9572 ESTs, Highly similar to CYA5J-IUMAN ADENY 
Hs.31442 RecQ protein-like 4 
Hs.334305 GS1999fuO 
Hs.5394 myosin IA 
Hs/175411 K1AA0865 protein 
Hs.1 11805 ESTs 

Hs.323428 ESTs, Moderately similar to ALU8.HUMAN A 
Hs.6163 PTEN induced putative kinase 1 
Hs.19210 hypothetical protein MGC1 1 308 
Hs.283976 hypthetical protein PR02389 
Hs.334703 hypothetical protein FU14529 

Empirically selected from AFFX single pr 
Hs.47623 ESTs 
Hs.220687 ESTs . 
Hs.269244 ESTs 

Hs.280792 hypothetical protein RJ12387 similar to 
Hs.61246 ESTs 

Hs.951 10 ESTs, Weakly simflar to A55943 1-phospha 
Hs.191637 ESTs 
Hs.82223 chordln-mce 
Hs.62633 ESTs 

Hs.28578 musdebllnd (Drosophila)-like 

Hs.72531 hypothetical protein FU1 1838 

Hs.66783 EST 

Hs.144269 ESTs. 

gbzm70h03,s1 Stratagene neuroepithellum 
gb:zf79b1£s1 Soaresj>inealgkraLN3HPG 

Hs.67726 macrophage receptor with collagenous str 
gb2m86a01.s1 Stratagene ovarian cancer 
gb:zm97c09.s1 Stratagene colon HT29 (937 
gb:zn13g03.s1 Stratagene hNT neuron (937 
gb:zm85a05.r1 Stratagene ovarian cancer 

Hs.49696 ESTs 

Hs.156637 Cas-Br-M (murine) ectropic retroviral tr 
Hs.74569 KIAA0842 protein 
Hs.158725 ESTs 
Hs.110470 ESTs 
Hs.73232 ESTs 
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09389 AA101325 Hs.86154 hypothetical protein FU 12457 10 

09546 F01449 Hs.26954 Homo sapiens mRNA; cDNA DKFZp762G123 (fr 5 

09919 R40604 Hs.1 29539 ESTs, Weakly similar to MCATJWMAN MITOC 10 

10006 AI094674 Hs.30524 ring finger protein 24 10 

10141 H46749 Hs.31540 ESTs 10 

10354 W22165 Hs.22586 ESTs 5 

10433 AW294162 Hs.301062 UDP-N^cetyl-alpha-D-galactosamlne:polyp 10 

10448 H51276 Hs.13526 hypothetical protein FU12688 . 10 

10455 H52576 gfcytBSeOArl Soares_pineaU!and_N3HPG 5 

10540 H72639 Hs.167608 ESTs 5 

10553 H60593 Hs.124990 ESTs 10 

10976 AL044174 Hs.159526 patched (Drosophlia) homolog 10 

10987 AI753316 Hs.26034 ESTs 5 

11158 N66616 Hs.138629 H^apiens mRNA for subtelomertc repeats 5 

11168 AI798376 gb*34b07j(1 NGI_CGAP_Ov23 Homo sapiens .10 

11187 AJ224864 Hs.9688 leukocyte membrane antigen 5 

11307 AA641636 Hs.37477 ESTs, Weakly similar to T46908 hypotheti 5 

11400 R00144 ■ Hs.189771 ESTs 10 

11498 AI168511 gb:ow90h09^1 Soares_fetaUiveLspteerL 10 

11651 R16733 Hs.20499 ESTs ' 10 

11738 R26065 gb:yh39d03.s1 Soares placenta Nb2HP Homo 5 

11803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5JHUMAN A 10 

11995 R42333 Hs.302292 ESTs 10 

12071 All 17490 Hs.47225 Ras^ssodated protetn Rap1 10 

12204 NMJJ06668 Hs.25121 cytochrome P450, subfamily 46 (cholester 10 

12258 R51889 Hs.24990 ESTs 5 

12490 R31094 Hs.24378 ESTs 10 

12588 R77302 gb:yi75h08.s1 Soares placenta Nb2HP Homo 10 

12654 BE618629 Hs.268809 ESTs 5 

12784 T98628 Hs.191290 ESTs 5 

12817 AI057205 Hs.14584 ESTs 5 

12885 AA581428 Hs.5021 EST 10 

12913 T16837 Hs.4241 ESTs 5 

13149 T51588 gb:yb27e06.s1 Stratagene fetal spleen (9 10 

13174 T54659 Hs.301755 Homo sapiens cDNA FU1 1465 fis, clone HE 5 

13203 AA743563 . Hs.10305 ESTs 5 

13299 AW207424 Hs.332594 ESTs 10 

13367 N92359 Hs.14518 ESTs, Moderately similar to A48752 B-cel 10 

13457 R16763 Hs.268679 ESTs 5 

H3563 AA913635 Hs.326413 Homo sapiens cDNA FU20812 fis, clone AD 10 

13574 R06874 Hs.268628 ESTs, Moderately similar to ALU1_HUMAN A 5 

13776 AI791905 Hs.95549 hypothetical protefri 10 

13790 AI244311 Hs.26912 ESTs 10 

13807 W07586 Hs.8045 ESTs 3 

13958 W86195 gb:zh54e05.s1 Soares_fetalJver_spleerL * 10 

14211 Z39319 Hs.27347 EST 10 

14254 AB018263 Hs.180338 tumor necrosis factor receptor superfam! 5 

14349 AA745978 Hs.28273 ESTs 5 

14449 AA020736, gbze63b1 1.s1 Soares retina N2b4HR Homo 5 

14484 AA034378 Hs.267319 endogenous retroviral protease 5 

14576 AA065096 gb2m50a02.s1 Stratagene fibroblast (937 5 

14624 AA081507 gbzn05b10.r1 Stratagene hNT neuron (937 5 

14844 AA234826 Hs.87386 EST 5 

,14906 AA234462 Hs.87350 ESTs 5 

15624 AK000725 Hs.50579 hypothetical protein FU2071 8 3 

15666 AF173081 Hs.178215 Vertebrate LIN7 homolog 1, Tax interact! 5 

15712 AB020649 Hs.74569 KIAA0842 protein 5 

15889 AA398841 Hs.39850 hypoftetical protein FU20517 10 

15949 AI478427 Hs.43125 esophageal cancer related gene 4 protein 10 

16107 AL133916 Hs.172572 hypothetical protein FLJ20093 10 

16180 AA463902 Hs.13522 ESTs, Weakly similar to I38022 hypotheti 5 

16267 AW968703 Hs.30085 hypothetical protein FU23186 5 

16291 AW410377 Hs.41502 hypothetical protein FU21276 5 

,16527 AW194253 Hs.68607 ESTs 10 

16659 BE314852 Hs.168694 Homo sapiens clone 23763 unknown mRNA, p 5 

,16708 F10528 Hs.70001 ESTs, Moderately similar to JC6169 nude 5 

17058 AW801806 gb:lL5-UM0070-110400-062-g07 UM0070Homo 5 
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117151 AI803656 Hs.42373 ESTs 

1 172^6 N20468 gb:yx39b1 (Xs1 Soares melanocyte 2NbHM Ho 

117323 AI472863 Hs.43387 ESTs 

117671 N34417 Hs.44584 ESTs 

117624 N26627 Hs.82364 ESTs, Weakly similar to JC4124 pregnancy 

117673 N40551 Hs.184043 Homo sapiens Ets-1 binding protein (E1B) 

117847 N49285 Hs.182391 ESTs 

117877 AW263476 Hs.44268 myelin gene expression factor 2 

117919 BE222341 Hs.279472 ESTs 

118049 N53145 gb:yv55f09.s1 Soares fetal liver spleen 

118413 AW955696 Hs.90960 ESTs 

118613 AI078236 Hs.49688 ESTs 

118664 N70907 Hs.230619 EST 

118858 AL122040 Hs.102981 Homo sapiens mRNA; cDNA DKFZp434G1972 (f 

118902 AA993527 ' Hs.293907 hypothetical protein FLJ234Q3 

119039 AI160570 Hs.252097- pregnancy specific beta-1 -glycoprotein 6 

119159 AF142419 Hs.15020 homolog of mouse quaking QKI (KH domain 

119216 AA514422 Hs.221849 ESTs 

119317 AK002001 Hs.51305 v-maf musculoaponeurot'c fibrosarcoma (a 

119366 T77892 ' gb^d20f04^1 Soares fetal fiver spleen 

119378 T81824 HS.9Q949 EST 

119528 W38051 Empirically selected from AFFX single pr 

119792 AL049798 Hs.80552 dermatopontin 

119800 AF086332 Hs:58314 ESTs 

119817 AF088061 Hs.159690 ESTs 

119835 AF086429 Hs.58429 ESTs 

119923 AW803308 Hs.62954 ferritin, heavypolypeptide 1 

119961 U34249 Hs.337461 Human putative zinc finger protein (ZNFB 

1 20379 AL042725 gb:DKFZp434B1822_r1 434 (synonym: htes3) 

120931 AW136934 Hs.97162 ESTs 

121037 AA907743 Hs.142373 ESTs 

121282 AA401695 Hs.97334 ESTs 

121382 AA405763 Hs.1 11939 Homo sapiens cDNA FU20470 fls, clone KA 

121764 AA421452 Hs.164851 ESTs, Weakly similar to KIAA0926 protein 

1 22034 AK000229 Hs.98017 Homo sapiens cDNA FU20222 fls, done CO 

122441 AA447555 Hs.99116 EST 

I22756 AA456945 Hs.95898 ESTs 

122771 AW135093 Hs.97282 ESTs, Highly similar to G100_HUMAN 110 K 

123601 AA609122 Hs. 11 2645 Homo sapiens mRNA; cDNA DKFZp434D2472 (f 

1 23623 A1024595 Hs.97508 a disintegrin and metalloproteinase doma 

123941 AA621529 gb:af47a02.s1 Soares_totaLfetusJlb2HF8.. 

124215 H62570 gb:yr44a01.r1 Soares fetal liver spleen 

124276 H83465 gb:ys91a1 U1 Soares retina N2b5HR Homo 

I24680 AK001527 Hs.163953 hypothetical protein RJ 10665 

I25099 NMJ)14312 Hs.112377 cortical thymocyte receptor pciaevfs 

125121 T98199 Hs.48403 hypothetical protein FU10847 

125188 BE299567 Hs.271749 ESTs, Moderately similar to ALU8.HUMAN A 

I25284 NNL002666 Hs.103253 periBpin 

I25906 BE256206 Hs.17775 p75NTR-associated cell death executor; o 

1 28484 AA485421 Hs.270503 ESTs, Weakly similar to ALU7JWMAN ALUS 

126511 NJVL002250 Hs.10082 potassium intermediate/small conductance 

1 28538 R44214 Hs.101189 ESTs 

1 28606 C16161 Hs.283040 hypothetical protein PR02543 

1 28850 AA193106 Hs.1 80817 chromosome 11 open reading frame 23 

1 28870 H39537 Hs.75309 eukaryotic translation elongation factor 

128903 AW150717 Hs.296176 STAT Induced STAT inhibitor 3 

128931 N62889 Hs.107242 Homo sapiens cDNA FLJ12965 fis, clone NT 

129001 AA443323 Hs.107812 BPOZ protein 

129091 AA056483 Hs.301463 Human Chromosome 16 BAC clone Cn"987SKTA 

129101 NM_013403 Hs.10B665 zinedin 

129146 AL1 17472 Hs.108924 SH3-domaln protein 5 (ponsin) 

1 29213 AI146494 Hs.109525 ESTs, Weakly similar to IRX2.HUMAN IROQU 

1 29228 U40714 Hsi39307 tyrosyl-tRNA synthetase . 

129265 AA530892 Hs.171695 dual specificity phosphatase 1 

I29285 BE617015 Hs.11006 ESTs, Moderately similar to T17372 plasm 

I29346 AF110141 Hs.288908 WAS protein family, member 2 

I29368 NlvL003877 Hs.110776 STAT Induced STAT lnhlbitor-2 
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129371 X06828 Hs.1 10802 von Willebrand factor 5 

129381 AW245805 Hs.110903 daudin 5 (transmembrane protein deleted 10 

129440 W37944 Hs.4007 Sarcoternmal-assoclated protein 5 

129441 BE061069 Hs.301943 KIAA0467 protein 10 ► 
5. 129516 AF020038 Hs.11223 Isodtrate dehydrogenase 1 (NADP+), solu 10 

129554 BE222078 Hs.113069 ESTs .10 

129684 BE622468 Hs.11924 ESTs, Weakly similar to (38022 hypotheti 5 

129702 AI304966 Hs.12035 ESTs, Weakly similar to I38022 hypotheti 5 

129778 AK001676 Hs.12457 hypothetical protein FU10B14 10 

10 129893 AK000956 ' Hs.13209 hypothetical protein RJ10094 5 

129928 A1338993 Hs.134535 ESTs 5 

129973 AJ251760 Hs.273385 guanine nucleotide binding protein (Gpr 5 

129977 NM.000399 Hs.1395 early growth response 2 (Krox-20 (Drosop 5 

130014 NM_001158 Hs.143102 amine oxidase, copper containing 2 (rati 5 

15 130085 M62402 Hs.274313 insuHn-fike growth factor binding prote 10 

130089 AA452006 Hs.333199 ESTs 5 

130162 W80711 Hs.319946 Homo sapiens mRNA for K1AA1727 protein, 5 

130243 D88435 , Hs.153227 cyclin G associated kinase 10 

130315 A1241084 Hs.1 54353 nonselective sodium potassium/proton exc 5 

20 130339 AA435746 gb:zt79e03.s1 SoaresJestisJIHT Homo sap 5 , 

130400 V00517 Hs.283108 hemoglobin, gamma G 10. . 

130436 NM.001928 Hs.155597 D component of complement (adipsln) 10 

130478 X72308 Hs.251526 small inducible cytokine A7 (monocyte ch 5 

130480 BE222978 Hs.15760 MYG1 protein 10 

25 130494 AW390834 Hs.75874 pregnancy-associated plasma protein A 5 

130563 BE270472 Hs.279900 HSPC015 protein 10 

130589 AL110226 Hs.16441 DKFZP434H204 protein 10 

130606 AI652143 Hs.288382 hypothetical protein Ft) 131 11 5 . 

130634 AI769067 Hs.127824 ESTs, Weakly similar to T28770 hypotheti 3 ' 

30 130683 AA993269 Hs.17872 Homo sapiens, done IMAGE:3875012, mRNA 10 

130689 NMJ06691 Hs.17917 extracellular link domain-containing 1 10 

130716 AA232075 Hs.18259 XPA binding protein 1; putative ATP(GTP) 5 

130718 AF263462 Hs.18376 KIAA1319 protein 10 

130722 N41322 Hs.18441 ESTs 5 

35 130798 M81349 Hs.1 955 serum amyloid A4, constitutive 10 

130840 BE048821 Hs.20144 small Inducible cytokine subfamily A (Cy 10 

131184 AB040935 Hs.23954 cerebral ceil adhesion molecule 10 

131261 AA360419 Hs.171776 Inositol(myo)-1(or4)-monophosphatase1 10 

131282 X03350 . Hs.4 alcohol dehydrogenase 1B (class I), beta 10 

40 131328 AW939251 Hs.25647 v-fos FBJ murine osteosarcoma viral onco 10 

131340 AK000393 Hs.25817 BTB (POZ) domain containing 2 5 

131341 AF110908 Hs.297660 TNF receptor-associated factor 3 5 
131406 H83294 Hs.284122 Wnt inhibitory factor-1 5 
131489 BE394648 Hs.27414 hypothetical protein 5 

45 131543 AW966881 Hs.41639 programmed cell death 2 10 

131692 BE559681 Hs.30736 KIAA01 24 protein 5 

131753 AA829286 Hs.332053 serum amyloid A1 10 

131756 AA443966 Hs.315.95 ESTs 10 

131785 H69342 Hs.26320 TRABID protein 10 

50 131815 AA021258 Hs.32753 ESTs 5 

131819 BE244961 Hs.173103 FE65-LIKE2 5 

131828 AJ000263 Hs.278658 keratin, hair, basic, 6 (monilethrix) 10 

131888 AW294659 Hs.34054 Homo sapiens cDNA: FU22488 fis, done H 5 

131927 AJ003112 Hs.34780 doublecortex; lissencephaly, X-llnked (d 5 

55 131949 AK000010 Hs.258798 hypothetical protein FU20003 10 

132115 H81604 Hs.178471 K1AA0798 gene product . 5 

132177 X80818 Hs.178078 glutamate receptor, metabotropic 4 5 

132296 AA467752 Hs.195161 ESTs 5 

132426 AW1 18072 Hs.89981 dlacylglycerol kinase, zeta(104kD) 10 

60 132477 S68874 Hs.170917 prostaglandin E receptor 3 (subtype EP3) 5 

132675 AI291498 Hs.5476 Homo sapiens, clone IMAGE:3530123, mRNA, 10 

132796 NM-006283 Hs.173159 transforming, acidic coaed-coil contain .10 

132898 W28548 Hs.224829 ESTs 10 

132905 NM-004235 Hs7934 KruppeMike factor 4 (gul) 10 . 

65 132953 BE175645 Hs.321264 LBP protein 32 5 

133116 BE563966 Hs.6529 ESTs, Weakly similar to I78885 serine/th 5 

133120 NMJJ03278 Hs.65424 tetranectin (plasminogen-binding protein 10 
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133139 AF052138 Hs.6580 Homo sapiens cDNA: FU23227 fis, clone C .5 

133103 AA668224 Hs.6634 Homo sapiens cDNA: FU22547 fis, done H 5 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2LHUMAN FORKH 5 

133272 NN1002776 Hs.69423 katiikrein 10 (KLK10) (PRSSL1) (neal) 5 

133379 AA207059 gb:zq80h09.s1 Stratagene hNT neuron (937 5 

133407 AF017987 Hs.7306 secreted frizzJed-related protein 1 5 

133552 H21497 Hs7471 BBP-tike protein 1 5 

133702 L02321 Hs 75652 ' glutathione S-transferase M5 5 

133719 H26904 Hs75736 apoiipo protein D 5 

133731 N71725 Hs.272572 hemoglobin, alpha 2 10 

133789 T85626 Hs76239 hypothetical protein FU20608 5 

134007 AF072441 Hs7840 calcineurin binding protein 1 10 

134055 D86Q62 Hs. 182423 ES1 (zebrafish) protein, human homolog o 10 

134111 AI372588 Hs.8022 TU3A protein 10 

134117 AA081846 ' Hs7921 Homo sapiens mRNA;.cDNA DKFZp566E183 (fr 10 

134177 BE243319 Hs79672 . KIAA0652 gene product 5 

134308 AW905827 Hs.81454 ketohexokinase (fructokmase) 10 

134361 BE549343 Hs.62208 acyl-Coenzyme A dehydrogenase, very long 5 

134369 AF207664 Hs.8230 a disintegrin-iike and metalioprotease ( 5 

134449 L34155 Hs.83450 taminln, alpha 3 (nicein (150kD), kallni 5 

134467 AI190413 Hs.8?73 ESTs 10 

134496 M64936 gbJHomo sapiens retfnoic acid-indudble 10 

134510 NNL002757 Hs .250870 mltogen-activated protein kinase kinase 10 

134550 M26315 Hs.85258 C08 antigen, alpha polypeptide (p32) 10 

134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor 5 

134591 U73394 Hs.1 66085 killer cell immunoglobulin-like receptor 5 

134678 AL008583 Hs.182595 dyneln, axonemal, fight polypeptide 4 5 

134728 D10216 Hs.89394. POU domain, class 1, transcription facto 5 

134758 NM.000078 Hs.89538 cholesteryi ester transfer protein, plas 10 

134786 T29618 Hs.89640 TEK tyrosine kinase, endothelial (venous 10 

134912 T87521 Hs.261457 ESTs 5 

134963 NM.003394 Hs.91935 wingless-type MMTV Integration site fami 10 

134969 H22570 Hs.172572 hypothetical protein FU20093 5 

135001 AA302517 Hs.92732 KIAA1444 protein 5 

135066 X04430 Hs.93913 interieukln 6 (interferon, beta 2) 10 

135173 AL036557 Hs.95910 putative lymphocyte 60/61 switch gene 10 

135197 U76456 Hs.190787 tissue inhibitor of metalloproteinase 4 5 

135219 AB002361 Hs.96633 KIAA0363 protein 5 

135250 U83171 Hs.97203 . small Inducible cytokine subfamily A (Cy 5 

135304 AA416829 Hs.191597 ESTs 5 

135337 AA905406 Hs.9905 ESTs, Weakly similar to unnamed protein 3 

135417 X55019 Hs.99975 choBnerglc receptor, nicotinic, delta p 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 5 

128870 H39537 Hs75309 eukaryotic translation elongation factor .5 

129381 AW245B05 Hs.1 10903 claudin 5 (transmembrane protein deleted 5 

130085 M62402 Hs.27431 3 insuBn-fike growth factor binding prate 5 

130689 NM.006691 Hs.17917 extracellular link domain-containing 1 10 

133120 NfVL003278 Hs.65424 tetranectin (plasminogen-binding protein 3 

133407 AF017987 Hs7306 secreted frlzzied-related protein 1 5 

133731 N71725 . Hs.272572 hemoglobin, alpha 2 5 

134369 AF207664 Hs.8230 a disintegrin-like and metalioprotease ( 5 

135066 X04430 Hs.93913 interieuWn 6 (interferon, beta 2) - 10 

135173 AL036557 Hs.95910 putative lymphocyte 60/61 switch gene 5 

322580 AK001852 Hs.274151 iigatln. 5 

408790 AW580227 Hs.47860 neurotrophic tyrosine kinase, receptor, type 2 10 

418043 AW377752 Hs.83341 AXL receptor tyrosine kinase 5 

427458 BE208364 Hs.29283 ESTs, WeaWy similar to LKHU proteoglycan link. 5 

446674 AA563892 Hs. 306000 solute carrier family 4 (anion exchanger), memb 10 

449826 U85642 Hs.138506 ESTs 5 

RC_H15814_s Human apM1 mRNA for 6S3109 (novel adipose specific collagen 

YEL024w/RIP1 EST - YEL024w/RIP1 3 
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TABLE 1A 

Table 1 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: • Genbank accession numbers 



Pkey CAT Number Accessions 



108446 112224J 
108497 110079.2 



124215 
117058 
110455 
111168 



1597154J 
1219924J 
46874.1 
38585.1 



111498 
104340 
103747 
134496 



411008J 
46289J0 
117944 J 
46501J 



AJ684569AA257011 



103750 118355.1 
105239 34624.1 



120379 34624J 



114624 
106851 
108392 
100545 
100654 

100702 
102208 



AA085383 AA126091 AA074174 AA075373 AA079120 AA070831 AA075978 AA075372 AA128503 

AA074897 AA113914 AA064871 AA079329 M071309 AA084710 AA129030 AA075042 AA074794 M071453 AA078803 

AA148628 AA122204 AA074159 AA1261B5 AA079117 AA127089 AA070912 AA079280 AA131372 AA078833 AA071087 

AA076131 AA071047 AA079401 AA083070 AA102076 AA115163 AA074198 AA134725 AA113889 M121103 AA075041 

AA065148 AA071310 AA101 144 AA079659 M078931 AA079209 AA070928 AA068994 AA069817 AA076187 AA069053 

AA1 31489 AA071308 AA063317 AA070156 AA071430 AA076056 M075684 AA070053 M126283 M126078 M075895 

AA079208 AA074583 AA071086 AA079623 AA070627 AA078802 AA076622 AA065051 AA079143 AA071 1 10 AA079434 

AA148748 AA079230 AA085188 AA074485 AA070580 AA076151 AA083166 AA0B51 18 AA079450 AA085044 AA120938 

AA079200 AA100188 AA081472 AA122355 M129031 AA085362 M069220 AA070940 AA075968 AA074563 AA084027 

AA115929 

H62570 H59063 

AW801606 H90434 BE086530 

H52576 AF085971 H52172 

A1798376 S46400 AW811617 AW811616 W00557 BE142245 AW858232 AW861851 AW858362 AA232351 AA218587 
AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 
AA157718 AA157719 AA100472 AA100774 AA130756 AA1 57705 AA157730 AA157715 AA053524 AW849581 AW854566 
C05254 AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 
H66215 AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 
AW817659 BE081531 H59570 
AI168511 AI022712 AA700366 R07371 R07324 
AA426189 F15201 
AA081995 AA101099 

M64936 AI025512 AI382987 BE061777 AA089966 BE169930T41176 AW594624 BE502415 AA121893 AI269283 T40311 



AI079277 A1241318 BE327710 AW975215 AW896268 AA884990 BE327514 
AA1 261 29 AA1 26033 AA082561 

AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 
AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA113928 AA3079M 
C16859 

AL042725 BE063316 AW975610 AA457591 BE062092 AI655202 AA714296 AI267264 AI075321 M223286 AA071122 
AA227849 AA216700 AI696002 M101857 AA099426 AA135997 AL041698 T02815 T51824 AA207189 T59230 T51868 
AA663341 BE165757 AW818104AW392886AA584918 AA099408 AW85639B AW861859 AA053045 
AA081507 M070071 AA070840 AA084362 
AI458623 AA639708 AA485409 R22066 AA485570 
AA075124 AA075208 
M55405 AW752552 

tigr HT2969 A03758 A06977 A15293 D17029 D17107 D17171 L00132 100133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 
X51363X51364X51365 
L27065 

U22961 AA203623 AA503337 AI174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 M602296 AI378195 
AI209170 AI186653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 A1042512 AI383957 AA156873 AI333101 
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322947J 
113549J 
22955.11 
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N70806 AI141254 A1383191 AI401 237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 WB4464 AA625146 R68379 AI133207 AI132980 AI133214 AI064826 AI061615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI2Q7526 AI133120 AI064802 A1174993 Ah 14729 AI061645 A1064716 AI064959 H77388 
T85706 AF075298 A11 10799 D17107 NM J00477 AF190168 R50724 AI248416 AI207432 AH33684 AI133345 AI174710 
AM 33290 AI133304 AI174948 AI207484 AI1 10717 AF074624 All 14515 AF063516 AI110642 AI1 14559 AI1 14498 AI114759 
AI207568 AI064960 AI174753 AI1 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72061 N80533 T51 189 T74936 AI207490 AI132925 AKJ64701 
AI174748 AI1 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
AI444620 T52290 D16931 T40012 T48403 T58926 T69195 AM33061 T50850 AI400677 AI091 136 AA334608 T5741 1 220979 
N56507 T87485 AI133622 AA343370 T40075 T69671 T53849 T74820 AF075316 A1110818 T40121 T57381 AM 14468 
AA332728 T51362 AH 14589 R06691 AI1 10629 AF063503 AI140543 AA334661 AA332720 AA343262 T7351 3 T86549 
AI114840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 A1064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AI1 14792 H93911 AI133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AH33040 AI133017 AI064857 AH 10730 AF074637 AI207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692T64739 T40163T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 T84512 T55918 A1207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61 233 T39955 T60512 AI114676 AI064778 AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AH 14710 T51 776 AA343213 AH14714 T58102 AI110809 R28984 AH74854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AF075308 W86731 T82851 T48269 
H54053 T7321 1 AH 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 AI1331 17 AI766232 AJ913646 
T83962 AI065112 AI207689 AI174684 AI207702T81475 AH33325 AI032512 AA701169 AI936354 AH 14720 AI433289 
AA046980 AI823482 AH 14536 AAB60651 AW242644 R07469 AW300438 AI13341 6 AW271 670 AI991363 T78943 AI823481 
AA845518 AA71 9124 AA883454 T68850 T691 15 AI935509 AI150977 T62890 T71374 T68294 AI174774 T6741 1 T68318 
AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 AA025050 T73330 
W52657 T71984 T69118 W92684 AI114860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 T57232 AI336158 
T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 A1187111 T64308 T62071 T69427 AH 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 T64300 
T28321 T55864 
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TABLE 2: Figure 2 from BRCA 001 US 

Table 2 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: 
ExAccn: 
UntgenelD: 
Unlgene Title: 
R1: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unlgene number 

Unlgene gene title 

Ratio of normal breast tissue to tumor 



Pkey ExAccn UntgenelD Unlgene Tittle R1 

100499 T51986 Hs.283108 hemoglobin, gamma G 10 

100549 BE142019 Hs.222056 Homo sapiens cDNA FU1 1572 lis, clone HE 10 

100654 A03758. ' NMJX)0477*:Homo sapiens albumin (ALB), m 10 

100971 BE379727 Hs.83213 fatty acid binding protein 4, adipocyte 10 

101184 N1VL001674 Hs.460 activating transcription factor 3 10 

101336 NM_006732 Hs75678 FBJ murine osteosarcoma viral oncogene h 10 

101367 X03350 Hs.4 alcohol dehydrogenase 1B (class I), beta 10 

101447 M21305 gb:Human alpha satellite and satellite 3 10 

101461 N98569 . Hs.76422 phospholipase A2, group IIA (platelets, 10 

101511 M27826 Hs.267319 endogenous retroviral protease 10 

101736 M74447 Hs.502 transporter 2, ATP-blndlng cassette, sub-famfl 10 . 

102208 U22961 gb:Human mRNA clone with similarity to L 10 

102450 U48251 Hs.75871 protein kinase C binding protein 1 10 

102800 AA313538 gb:EST1 8541 9 Colon carcinoma (HCC) cell 10 

102857 NWL006744 Hs.76461 retinol-binding protein 4, interstitial 10 

102990 AA829286 Hs.332053 serum amyloid A1 10 

103747 AA081995 gb:zn26d06.r1 Stratagene neuroepithelium 10 

103812 AA137107 Hs.326391 Homo sapiens, clone MGC:16638, mRNA.com 10 

104093 R50727 Hs.336970 ESTs 10 

104109 AL353957 Hs.284181 hypothetical protein DKFZp434P0531 10 

104250 F06638 Hs.12440 Homo sapiens clone 24734 mRNA sequence 10 

104492 N73185 Hs.94285 EST 10 

104506 N91071 Hs.109650 ESTs 10 

104532 AI498763 Hs.203013 hypothetical protein FU1 2748 10 

104677 AAQQ9764 Hs.190380 ESTs 10 

104711 AA017245 Hs.32794 ESTs 10 ■ 

104731 AA019300 Hs.125070 ESTs, Moderately similar to I54374 gene 10 

105005 AI298208 Hs.28805 ESTs 10 

105036 AA130390 Hs.25549 hypothetical protein FU20898 10 

105239 AA221036 gb:zr03f12.rl Stratagene NT2 neuronal pr 10 

106052 N79885 Hs.6382 ESTs, Highly similar to T00391 hypotheti 10 

106181 A1803651 Hs.191608 ESTs 10 

106283 AI085846 Hs.25522 KIAA1808 protein 10 

106379 AL042069 Hs.119021 DKFZP434N061 protein 10 

106451 AW235928 Hs.313182 ESTs 10 

106491 AA135688 Hs.10083 Homo sapiens, clone IMAGE:4139786 t mRNA, 10 

106782 AW054886 Hs.25682 Homo sapiens mRNA for KIAA1863 protein, 10 

107124 AB006532 Hs.31442 RecQ protein-like 4 10 

107148 AI005036 Hs.334305 GS1999fuD 10 

107214 AF127026 . Hs.5394 myosin IA 10 , 

107242 AB020672 Hs.175411 KIAA0865 protein 10 

107331 AI905985 Hs.111805 ESTs 10 

107447 W28516 Hs.19210 hypothetical protein MGC1 1308 10 

107451 AL042425 Hs,283976 hypthelical protein PR02389 10 

107872 BE271708 Hs.95110 ESTs, Weakly similar to A55943 1-phospha 10 

108351 AA071193 . gbzf79b12.s1 Soares_pineaLsland_N3HPG 10 

109546 F01449 Hs.26954 Homo sapiens mRNA; cDNA DKFZp762G123 (fir 10 

110433 AW294162 Hs.301062 UDP^^cetyklpha-D^aladosamheipolyp 10 

110976 AL044174 Hs.159526 patched (Drosophila) homolog 5 

108 



WO 02/059377 



PCT7US02/02242 



ii 



10 



15 



20 



25 



30 



35 



40 



45 



50 



4 4 4 4rn 

111168 


AI798376 




gD;u34b07j(l NCLCGAP„uvZ3 Homo sapiens 


lU 


111651 


n A A** AA 

R16733 


(__| _ A A A A A 

Hs.20499 


ESTs 


m 
1U 


4 44 AAA 

111803 


AA593731 


Hs.325823 


ESTs, Moderately similar to ALU5J"IUMAN a 


in 


4 4 J J rt A 

114484 


AA034378 


| 1 ~ AA"»* « A, 

Hs.267319 


endogenous retroviral protease 


Hn 
1U 


A A A Art M 

125284 


NIVL0Q2666 


l 1 .a A A A a A 

Hs.1 03253 


penlipln 


10 


A A A ft wr « 

128850 


AA193106 


1 ■ AAAAA^V 

Hs.180817 


chromosome 11 open reading frame.23 


5 


J AA AAA 

128903 


a 1 a 1 J ^ A*^ A 

AW150717 


1 1 AAA J*»A 

Hs.296176 


STAT induced STAT Inhibitor 3 


•in 
10 


129346 


AF110141 


Hs.288908 


in « a • a* M 1- at 

WAS protein family, member 2 


10 


129381 


a 4 * A A A A A^ 

AW245805 


II A J a AAA 

Hs.110903 


claudin 5 (transmembrane protein deleted 


10 


129516 


AF020038 


Hs.1 1223 


t ii ■ ill A Sft. 1 A aA ft 1. 

Esocftrate dehydrogenase 1 (NADP+), solu 


A ft 

10 


a /iAr n a 

129554 


torn A 4ft MM A 

BE222078 


It A J A A A A 

Hs.1 13069 


-»AV ■ , 

ESTs 


J ft 

10 


j AAA* a « 

130085 


M62402 


■ ■ a^ a a .a a 

Hs.274313 


Insulin-like growth tactor binding prote 


10 


130243 


D88435 


Hs.153227 


cyclin G associated kinase 


10 


130400 


V00517 


Hs.283108 


hemoglobin, gamma G 


aa 

10 


130436 


NMJ01928 


Hs.155597 


D component of complement (adipsin) 


An 

10 


130563 


BE270472 


1 a a hji AAA 

Hs.279900 


■ lAnAn A A a T 

HSPC015 protein 


An 

10 


130589 


AL1 10226 


Hs.1644i 


■* * # ■ ■ i- jkk a A 1 1 1 A A J a 1 

DKFZP434H204 protein 


4 n 

10 


130683 


AA993269, 


Hs.17872 


» 1 • a l • a a Ar.lrtTrrt^fl _ _ **va i a 

Homo sapiens, clone 1MAGE:3875012, mRNA 


An 

10 


130689 


m. k Aa a a 

NMJJ06691 


Hs.17917 


a ti | k« | • _| ■ 1 1 * al 

extracellular link domain-containing 1 


An 

10 


130689 


AA046747 


Hs.17917 


a >* i l* i l * alt A) 

extracellular link domain-containing 1 


A ft 

10 


130718 


N70196 


Hs.18376 


■ J i a a j a% a Ak a ft 

KIAA1319 protein 


An 

10 


130798 


M81349 


Hs.1955 


■ a i a a iti if 

serum amyloid A4, constitutive 


al A 

10 


130840 


BE048821 


Hs.20144 


it • l *\ I ll» La* *"L_ A t^\. . 

small inducible cytokine subfamily A (Cy 


an 

10 


131184 


AB040935 


Hs.23954 


cerebral cell adhesion molecule 


at rt 

10 


131282 


X03350 


a i a 

Hs.4 


alcohol dehydrogenase 1B (class I), beta 


' An 

10 


131328 


AW939251 


Hs.25647 


# MB fl% a ■ A J 1 

v-fos FBJ murine osteosarcoma viral onco 


4 

10 


131543 


AW956881 


Hs.41639 


programmed cell death 2 


An 

10 


131753 


AA829286 


Hs.332053 


serum amyloid A1 


A f\ 

10 


131785 


H69342 


i 1 AAAAA 

Hs.26320 


TRAB1D protein 


An 

10 


131828 


AJ000263 


Hs.278658 


ft a* aft* ft a* a *l «L ♦ 1 

keratin, hair, basic, 6 (monilethrix) 


al A 

10 


132426 


AW1 18072 


Hs.89981 


diacylglycerol kinase, zeta (104k0) 


An 

10 


132675 


AI291496 


Hs.5476 


i ft t i ia « a AP AAAAA Art Aft 1 A 

Homo sapiens, clone IMAGE:3530123, mRNA, 


An 

10 


132898 


W28548 


Hs.224829 


ESTs 


4 ft 

10 


132905 


NM.004235 


Hs.7934 


a m % i*a a ■ a a «\ 

Kruppel-like factor 4 (gut) 


al #\ 

10 


Jt A A A A A 

133120 


a ik a A A. Art™* A 

NM_003278 


t \ — A r* a A J • 

Hs.65424 


tetranectin (piasmmogen-oinding protein 


10 


133407 


AF0 17987 , 


Hs.7306 


secreted frizzied-related protein 1 


10 


133719 


H26904 


Hs.75736 


apoiipoproteln D 


10 


134007 


AF072441 


Hs.7840 


caldneurin binding protein 1 


10 


134055 


D86062 


Hs.182423 


ES1 (zebrafish) protein, human homotog o 


10 


134111 


AI372588 


Hs.8022 


TU3A protein 


5 


134117 


AA081846 


Hs.7921 


Homo sapiens mRNA; cDNA DKFZp566E183 (fr 


5 


134177 


BE243319 


Hs.79672 


KIAA0652gene product 


10 


134369 


AF207664 


Hs.8230 


a dislntegrin-like and metalloprotease ( 


10 


134496 


M64936 




gb:Homo sapiens retinolc acld-inducible 


10 


134510 


NNL002757 


Hs.250870 


mitogen-activated protein kinase kinase 


10 


134550 


M26315 


Hs.85258 


CD8 antigen, alpha polypeptide (p32) 


5 


134758 


NVL000078 


Hs.89538 


cholestery) ester transfer protein, pbs 


5 


134963 


NIVL003394 


Hs.91985 


wingless-type MMTV integration site rami 


10 


135066 


X04430 


Hs.93913 


interieukin 6 (interferon, beta 2) 


10 


408790 


AW580227 


Hs.47860 


neurotrophic tyrosine kinase, receptor, type 2 


10. 


446674 


AA563892 


Hs.306000 


solute carrier family 4 (anbn exchanger), memb 


10 



it 



109 



WO 02/059377 PCT/US02/02242 



TABLE 2 A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

i 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

■ 

Pkey CAT number Accessions 

111168 38585 1 AI798376 S46400 AW811617 AW811616 W00557 BE 142245 AW858232 AW861851 AW858362 AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61 139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 AA100472 AA100774 AA130756 AA157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AW812621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
■ AA045564 AI694265 H60808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AWB17659 
BE081531 H59570 

103747 117944 1 AA081995 AA101099 

134496 46501 T M64936 AI025512 AI382987 BE061777 AA089966 BE169930 T41176 AW594624 BE502415 AA121893 AI269283 T40311 

A1684569 AA25701 1 AI079277 AI241318 BE327710 AW975215 AW896268 AA884990 BE327514 

105239 34624 1 AA221036 R87170 BE537068 BE544757 C18935 AW812058 T92565 AA227415 AA233942 AA223237 AA668403 AA601627 

AW869639 BE061833 BE000620 AW961 170 AW847519 AA308542 AW821833 AW945688 C04699 AA205504 AA377241 
AW821667 AA055720 AW817981 AW856468 AA155719 AA179928 T03007 AW754298 AA227407 AA1 13928 AA307904 
. C16859 

100654 ffgr HT2969 A03758 A06977 A15293 D17029 D17107 D17171 LD0132 L00133 M12523 M13075 M13076 M92816 U22961 V00494 V00495 

X51363 X51364 X51365 

102208 6735 9 U22961 AA203623 AA503337 AI174733 AM 92802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 A1378195 

AI209170 AI186653 AI127795 AI183846 H77389 A1589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 
AI685092 AI093426 AI623873 AI074570 N50096 AA047486 N25060 AA327614 AI042512 AI383957 AA156873 AI333101 
N70806 AI141254 AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 
AA877103 W84464 AA625146 R68379 AI133207 AI132980 AI133214 AI064B26 AI061 615 AI133473 AI174852 AI133404 
AI133272 V00494 M12523 M12523 AI207526 AI133120 AI064802 AI174993 Ah 14729 AI061645 AI064716 AI064959 H77388 
T85706 AF075298 AM 10799 D17107 NMJ)00477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 AI174710 
AI133290 AI133304 AI174948 AI207484 AH 10717 AF074624 AI114515 AF063516 AI110642 AI1 14559 A1114498 AI114759 
AI207568 AI064960 AI174753 AM 14666 R69184 R0001 1 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 
H94399 AI133158 T74675 AA484750 T73413 T55909 R50261 T72081 N80533 T51 189 T74936 A1207490 AM 32925 AI064701 
AI174748 AH 14663 AI133104 AI132999 AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 
. AW44620 T52290 D16931 T40012 T48403 T58926 T69195 AI133061 T50B50 AI400877 AI091136 AA334608 T57411 Z20979 
N56507 T87485 AI133622 AA343370T40075 T69671 T53849 T74820 AF075316 AI1 10818 T40121 T57381 AM 14468 
AA332728T51362 A11 14589 R06691 AI1 10629 AFQ63503 AM 40543 AA334661 AA332720 AA343262T73513T86549 
AM 14840 T57284 T39981 T61407 T72757 T51749 T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 
AI132926 R09237 AI064838 AI133660 T60398 T88753 T55930 T92126 AI444602 T60996 AM14792 H93911 AI133106 
R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 AI133040 AI133017 AI064857 AI1 10730 AF074637 AI207567 
H71080 T73217 AA343950 AI174743 AA334224 AA334281 R06692 T64739 T40163 T60628 T81661 T73179 R01842 
AA501730 T39931 T39662 T40136 AA334904 T71425 H77784 R00874 AI065049 TB4512 T55918 AI207595 T39951 
AA005016 T60361 T69176 T73356 T58795 T61233T39955 T60612 AH14676 AI06477B AA035710 W52763 AI114786 
T83564 AA341859 T81684 T55769 AM 14710 T51776 AA343213 AI114714 T58102 AI1 10809 R28984 AI174854 AA305675 
AA343592 T53836 T46869 T64721 T55508 W05241 T54019 T57945 T60513 T48364 AFO75308 W86731 T82851 T48269 
H54053 T7321 1 AM 14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 R02576 T95666 AI203974 
AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936693 AI133117 AI766232 AI913646 
T83962 AI065112 AI207689 AI174684 AI207702 T81475 AI133325 AI032512 AA701169 AI936354 AI114720 AI433289 
AA046980 AI823482 AI114536 AA860651 AW242644 R07469 AW300438 AJ133416 AW271670 A1991363 T78943 AI823481 
AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 T67411 T68318 
AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 A1205880 T62895 T69430 T95111 AA025050T73330 
W52657 T71984 T691 18 W92684 AM 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T6181 1 T57232 AI336158 
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102800 
108351 
101447 



T61821 T69457 T62900T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T71311 T61819 T89358 T67708 T70918 
T59166 AM 871 11 T64308 T62071 T69427 AI1 14750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 T69394 
AI207729 T55839 T90273 T73339 AW194909 T7548B T71850 T71305 T71287 T53877 T73452 T68852 N75290 AI312890 
T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D1 1809 D12412 T64300 
T28321 T55864 

14782J20 AA313538 U88895 U88902 

genbankJW)71193 AA071193 

entreO121305 M21305 
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TABLE 3: Figure 3 from BRC A 001 US 
Table 3 shows genes downregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : RatJo of normal breast tissue to tumor 



Pkey ExAccn UnigenelD UnigeneTitte R1 

101336 NIVL006732 Hs75678 FBJ murine osteosarcoma viral oncogene h 10.0 

102208 U22961 * gb:Human mRNA clone with similarity to L 10.0 

102990 AA829286 Hs.332053 serum amyloid A1 10.0 

1 1 1 168 AI798376 gb:tr34b07.x1 NCI_CGAP_Ov23 Homo sapiens 10.0 

111803 AA593731 Hs.325823 ESTs, Moderately similar to ALU5_HUMAN A 10.0 

130085 M62402 Hs.27431 3 insulin-tike growth factor binding prote 10.0 

130840 BE048821 Hs.20144 small Inducible cytokine subfamily A (Cy 10.0 

131543 AW966881 Hs.41639 programmed cell death 2 10.0 

133120 NMJJ03278 Hs.65424 tetranectin (plasminogen-blndlng protein 10.0 

134758 NMJ00078 Hs.89538 choiesteryl ester transfer protein, p!as 10.0 
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TABLE 3A 

Table 3 A shows the accession numbers for those pkeys lacking unigenelD's for Table 3. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
inRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

111168 38585 1 AI798376 S46400AW81 1617 AW81 1616 W00557 BE142245AW858232AW861851 AW858362AA232351 AA218567 

AA055556 AW858231 AW857541 AW814172 H66214 AW814398 AF134164 AA243093 AA173345 AA199942 AA223384 
AA227092 AA227080 T12379 AA092174 T61139 AA149776 AA699829 AW879188 AW813567 AW813538 AI267168 AA157718 
AA157719 AA100472 AA100774 AA1 30756 AA 157705 AA157730 AA157715 AA053524 AW849581 AW854566 C05254 
AW882836 T92637 AWB12621 AA206583 AA209204 BE156909 AA226824 AI829309 AW991957 N66951 AA527374 H66215 
AA045564 AI694265 H6O808 AA149726 AW195620 BE081333 BE073424 AW817662 AW817705 AW817703 AW817659 
BE0B1531 H59570 

1022086735.9 U22961 AA203623 AA503337 A1174733 AI192802 C06092 AA035357 AI190619 AI199244 AI828450 AA602296 A1378195 

AI209170 AI1B6653 AI127795 AI183846 H77389 AI589465 AA629390 H94306 AI018388 R68584 AA027196 AI745413 AI685092 
AI093426 AI623673 AI074570 N50096 AA047486 N25060 AA327614 AKJ42512 AI383957 AA156fii73 AI333101 N70806 AI141254 
AI383191 AI401237 AI080709 AI093400 W84549 T90806 R00012 W01413 AA630557 AI378348 AI559265 AA877103 W84464 
AA625146 R68379 AI133207 AI132980 AI133214 AI064826 AI061615 A1133473 AI174852 AI133404 AI133272 V00494 M12523 
M12523 AI207526 AI133120 AI064802 AI174993 AH 14729 AI061645 A1064716 AI064959 H77388 T85706 AF075298 AI1 10799 
D17107 NMJ00477 AF190168 R50724 AI248416 AI207432 AI133684 AI133345 A1174710 AI133290 AI133304 AJ174948 
AI207484 AM 10717 AF074624 AI1 14515 AF063516 AM 10642 AI1 14559 AH 14498 AH 14759 AI207568 AI064960 AI174753 
AI114666 R69184 R00011 AI064997 T60501 AI207701 T71735 AA385318 H73569 T60496 H94399 AI133158 T74675 AA484750 
T73413 T55909 R50261 T72061 N80533 T51189 T74936 A1207490 AI132925 A1064701 A1174748 AI114653 AI133104 AI132999 
AI133100 AI064925 AI064979 AI133063 AA343347 T69091 AA233989 T39772 AI444620 T52290 D16931 T40012 T48403 
T58926 T69195 AI133061 T50850 AI400677 AI091 136 AA334608 T5741 1 Z20979 N56507 T87485 AH33622 AA343370 T40075 
T69671 T53849 T74820 AF075316 AH 10818 T40121 T57381 Ai 114468 AA332728 T51362 AH 14589 R06691 AI110629 
AF063503 AH40543 AA334661 AA332720 AA343262 T73513 TB6549 AH 14840 T57284 T39981 T61407 T72757 T51749 
T56630 AA343125 T72126 R94135 T83028 T39972 T39896 AI174786 AH32926 R09237 A1064838 AI133660 T60398 T88753 
T55930 T92126 A1444602 T60996 AH 14792 H9391 1 AH33106 R10779 AI065020 T90925 T50889 D17029 AI133703 AA333805 
AI133040 AI133017 AI064857 AM10730 AF074637 AI207567 H71080 T73217 AA343950 AI174743 AA334224 AA334281 
R06692 T64739 T40163 T60628 T81661 T73179 R01842 AA501730 T39931 T39662 T40136 AA334904 T71425 H777B4 R00874 
Ai065049 T84512 T55918 AI207595 T39951 AA005016 T60361 T69176 T73356 T58795 T61233 T39955 T60612 AI114676 
AI064778 AA035710 W52763 AM 14786 T83564 AA341859 T81684 T55769 AH 14710 T51776 AA343213 AH 14714 T58102 
AH 1 0809 R28984 AH74854 AA305675 AA343592 T53836 T46869 T64721 T55508 W0524 1 T54019 T57945 T60513 T48364 
. . AF075308 W86731 T82851 T48269 H54053 T73211 AH14590 T48317 T55965 T74857 R84226 T56552 T52231 T74946 T76976 
R02576 T95666 AI203974 AI189471 AA005147 AI478102 AI207662 AI192792 AI768421 AI064737 AW051713 AA936<B93 
AI1331 17 AI766232 AI913646 T83962 A1065112 A1207689 AI174684 AI207702 T81475 AH33325 AI032512 AA701 169 AI936354 
AH 14720 AJ433289 AA046980 AI823482 AH 14536 AA860651 AW242644 R07469 AW300438 A1133416 AW271670 AI991363 
T78943 AI823481 AA845518 AA719124 AA883454 T68850 T69115 AI935509 AI150977 T62890 T71374 T68294 AI174774 
. T67411 T68318 AI064689 T56624 T69010 T68982 T68302 AI332829 T72908 AI064819 AI205880 T62895 T69430 T95111 
AA025050 T73330 W52657 T71984 T691 18 W92684 AH 14860 T62093 T61797 AI522333 T73322 H92981 T56018 T61811 
T57232 AI336158 T61821 T69457 T62900 T62912 T72917 T46885 AI702448 T57212 T57203 R94581 T7131 1 T61819 T89358 
T67708 T70918 T59166 AI1871 11 T64308 T62071 T69427 AI114750 T60430 R09734 T69033 T69141 T69453 T67908 R16809 
T69394 AI207729 T55839 T90273 T73339 AW194909 T75486 T71850 T71305 T71287 T53877 T73452 T68852 N75290 
AI312B90 T67751 AI174983 T51679 T54851 H69880 N73734 AA443453 T73466 H69672 N53869 T68447 D11809 D12412 
T64300 T28321 T55864 
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TABLE 4: Figure 4 from BRC A 001 US 
5 Table 4 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: 
ExAccn: 
10 UnlgenelD: 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbahk accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal breast tissue 



15 Pkey ExAccn UnigenelD Unigene Title 



R1 



100113 

100114 

100131 

100146 

100147 

100154 

100163 

100220 

100265 

100271 

100275 

100323 

100335 

100364 

100372 

100393 

100400 

100418 

100482 

100518 

100666 

100667 

100668 

100678 

100685 

100690 

100783 

100850 

100892 

100945 

100969 

100988 

100999 

101031 

101045 

101077 

101093 

101161 

101186 

101216 

101228 

101247 

101249 

101332 

101332 

101352 

101396 

101445 

101470 



NNL001269HS.84746 
X02308 Hs.82962 
D12485 Hs.119^1 
BE185499 Hs.2471 
D13666 Hs.136348 
H60720 Hs.81892 
W44671 Hs.124 
AW015534 Hs.217493 
D38521 Hs.1 12396 
BE160081 Hs.256290 
BE242802 Hs.154797 
D50920 Hs.23106 
AW247529 Hs.6793 
NM_004341Hs.154868 
NM_014791Hs.184339 
D84145 Hs.39913 
AW954324 Hs75790 
D86978 Hs.84790 
M65028 Hs.81361 
NM_004415Hs.74316 
L05424 Hs.169610 
L05424 Hs.169610 
L05424 Hs.169610 
AW502935 Hs.740 
AA328229 Hs.184582 
AA383256 Hs.1657 
AF078847 Hs.191356 
AA836472 Hs.297939 
BE245294 Hs.180789 
AF002225 Hs.180686 
AA157634 Hs.79172 
AK000405 HS76480 
H38765 Hs.80706 



J05070 
J05614 
N99692 
L06419 



Hs.151738 

Hs75227 
HS75093 

NMJ506262HS.37044 
AA020956 Hs.179881 
AA284166 Hs.84113 
AA333387 Hs.82916 
M132666 Hs78802 
L18964 Hs.1904 
JQ4088 Hs.1 56346 
J04088 Hs.1 56346 
A1494299 Hs.16297 
BE267931 Hs.78996 
M21259 

NtVL000546Hs.1846 



chromosome condensation 1 
thymidilate synthetase 

ectonudeotide pyrophosphatase/phosphodiesterase 1 
KIAA0020gene product 

osteoblast specific factor 2 (fascicDn Mike) (perfostin) 
K1AA0101 gene product 

gene predicted from cDNA with a complete coding sequence 
annexinA2 
KIAA0077 protein 

S100 calcium-binding protein A11 (cafgizzarin) 
K1AA0090 protein 
KIAA01 30 gene product 

platelet-activating factor acetythydrolase, feoform lb, gamma subunit (29kD) 
carbamoyl-phosphate synthetase 2, aspartate transcarbamylase, and dlhydroorotase 
K1AA0175 gene product 
novel RGD-contalnlng protein 
phosphatidyBnosItol glycan, dass C 
KIAA0225 protein 

heterogeneous nuclear rfbonudeoprotein A/B 
desmoplakin(DPl,DPll) 

CD44 antigen (homing function and Indian blood group system) 
CD44 antigen (homing function and Indian blood group system) 
CD44 antigen (homing function and Indian blood group system) 
PTK2 protein tyrosine kinase 2 
ribosoma) protein L24 
estrogen receptor 1 

general transcription factor IIH, polypeptide 2 (44kD subunit) 
cathepslnB 
S164 protein 

ubiquitin protein ligase E3A (human papilloma virus E6-assodated protein, Angelman syndrome) 1.5 
solute carrier family 25 (mitochondrial carrier; adenine nudeotide translocator), member 5 6.3 
ubiquitin-Gke4 

diaphorase (NADH/NADPH) (cytochrome t>-5 reductase) 
matrix metalloprotelnase 9 (geiatinase B. 92kD gelatinase, 92kD type IV collagenase) 
gb:Human proliferating ceil nuclear antigen (PCNA) gene, promoter region. 
Empirically selected from AFFX single probeset 



2.3 
2.9 
1.9 
1.9 
7.5 
92 
1.6 
2.0 
1.5 
13.5 
5.1 
1.9 
2.7 
ZO 
2.6 
3.2 
15 
2.0 
2.9 



1.9 
5.7 
9.0 
7.6 
535 
1.8 
1.6 
5.9 
1.7 
1.7 



11.4 
1.6 
8.2 
5.0 
2.6 



procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase, Ehlers-Dantos syndrome type VT)1.4 



peripherin 
core-binding factor, beta subunit 

cycfin-dependent kinase inhibitor 3 (CDK2-assodated dual specificity phosphatase) 

chaperonin containing TCP1, subunit 6A(zeta1) . 

glycogen synthase kinase 3 beta 

protein kinase C, iota 

topolsomerase (DNA) ll alpha (170kD) 

topotsomerase pNA) II alpha (170kD) 

COX17 (yeast) homolog, cytochrome c oxidase assembly protein 

proliferating cell nudear antigen 

gbHuman Aiu repeats in the region 5* to the smalt nudear rib 

tumor protein p53 (U-Fraumenl syndrome) , 
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16.9 
2.0 
1.8 
1.7 
1.9 
1.5 
5.2 
3.4 
6.3 
4.2 
1.9 
1.6 
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101478 NIVL002890HS.758 RAS p21 protein activator (GTPase activating protein) 1 Z5 

101483 M24486 Hs.76768 procollagen-proline, 2-oxoglutarate 4«dioxygenase (proline 4-hydroxylase), alpha polypeptide I ' 5.5 

101540 J04977 Hs.84981 X-ray repair complementing defective repair in Chinese hamster cells 5 (double-strand-break rejoining 2.1 

101573 AW248421 Hs.250758 proteasome (prosome, macropain) 26S subunit, ATPase, 3 .16 

101580 NNL012151Hs.83363 coagulation factor vlll-assoclated (intronic transcript) 5.7 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G protein), beta polypeptide 2 18 

101592 AF064853 Hs.91299 guanine nucleotide binding protein (G pr 5.6 

101621 BE391804 Hs.62661 guanylate binding protein 1, Interferon-inducible, 67kD 2.4 

101702 AW504089 Hs.179574 protein phosphatase 2 (formerly 2A), regulatory subunit B (PR 52), alpha Isoform 13 

101734 M74099 Hs.147049 cut (Drosophila)-like 1 (CCAAT displacement protein) 2.1 

101759 M80244 Hs.184601 solute carrier family 7 (cationlc amino add transporter, y+ system), members 5.0 

101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 14.4 

101782 AA306495 Hs.1869 phosphoglucomutase 1 5.2 

101805 AW409747 Hs.75612 stress-induced-phosphoproteln 1 (Hsp70/Hsp90-organlzing protein) .8.6 

101806 AA586894 Hs.1 12408 S100 calcium-binding protein A7 (psorlasin 1) 8.9 
101810 NMJ)00318Hs.180612 peroxisomal membrane protein 3 (35kD, Zellweger syndrome) 3.2 
101879 AA176374 Hs.243886 nuclear autoantigenlc sperm protein (histone-binding) 16 
101911 AA441787 Hs.1 19689 glycoprotein hormones, alpha polypeptide 31.3 
101920 AF182645 Hs.8024 IK cytokine, down-regulator of HLA II 18 
101973 U41514 Hs.80120 UDP-N-aatyl-alpha^alactosamine:polypeptide N-acetyigalacto^ 2.4 
101983 AI904232 Hs.75323 prahlbltln 6.4 
102009 BE245149 Hs.82643 protein tyrosine kinase 9 13 
102036 BE250127 Hs.82906 CDC20 (cell division cycle 20, S. cerevisiae, homolog) 2.0 
102083 T35901 Hs.75117 Interleukin enhancer binding factor 2, 45kD 16 
102083 T35901 Hs.75117 Interleukin enhancer binding factor 2, 4 13 
102107 BE258602 Hs.182366 heat shock protein 75 14 
102123 NM 001809HS.1594 centromere protein A (17kD) 1.8 
102165 BE313280 Hs.159627 death associated protein 3 4.6 
102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regulatory subunit (50kD) 4,3 
102217 AA829978 Hs.301613.JTV1 gene 6.7 
102220 U24389 Hs.65436 lysosomal 43 
102234 AW163390 Hs.278554 heterochromaWke protein 1 19 
102260 AL039104 Hs.159557 karyopherin alpha 2 (RAG cohort limportin alpha 1) 4.4 
102302 AA306342 Hs.69171 protein kinase C-Eke 2 2.7 
102330 BE298063 Hs.77254 chromobox homolog 1 (Drosophlla HP1 beta) 15 
102339 BE378432 Hs.95577 cyclln-dependent kinase 4 2.3 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member B2 2.0 

102349 AU077055 Hs.289107 bacutoviral IAP repeat-containing 2 3.2 
102369 U39840 Hs.299867 hepatocyte nuclear factor 3, alpha 2.0 
102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 6.2 
102391 AA296874 Hs77494 deoxyguanosine kinase 15 
102455 U48705 Hs75562 discoidin domain receptor family, member 1 6.9 
102465 NNL001359HS.81548 2,4-dienoyl CoA reductase 1, mitochondrial 18 

102488 U50939 Hs.61828 amyloid beta precursor protein-binding protein 1, 59kD 15 

102489 AL080116 Hs.74420 origin recognition complex, subunit 3 (yeast homolog)-Iike 3.3 
102494 AI188137 Hs.75193 COP9 homolog 2.1 
102501 AF217197 Hs.74562 siah binding protein 1;FBP interacting repressor; pyrimidine tract binding spOcing 3.2 . 
102522 BE250944 Hs.1 63556 solute carrier family 1 (neutral amino acid transporter), member 5 2,8 
102532 AF040253 Hs70186 suppressor ofTy(S.cerevfelae) 5 homolog 5.7 
102564 U59423 Hs.79067 MAD (mothers against decapentapleglc, Drosophlla) homolog 1 . 2.3 
102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatidate cytidylyltransferase) 1 2.1 

102581 AU077228 Hs.77256 enhancer of zeste (Drosophlla) homolog 2 16 

102582 U61232 Hs.32675 tubulin-specificchaperonee 2.1 

102617 AW161453 Hs.198767 COP9 (constitutive photomorphogenlc, Arabidopsis, homolog) subunit 5 18 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 
102627 AL021918 Hs.158174 zinc finger protein 184 (KruppeWke) 13 
102663 NM_002270Hs.168075 karyopherin (importin) beta 2 18 
102676 BE262989 Hs.12045 putative protein 2.3 
102687 NIVL007019HS.93002 ubtquitin carrier protein E2-C 4.3 
102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, type II 6.0 
102696 BE540274 Hs.239 forkhead box M1 4.2 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 19 

102705 T97490 Hs.50002 smafl inducible cytokine subfamily A (Cys-Cys), member 19 2.3 . 
102750 AB014460 Hs.66196 nth (E.coii endonudease HlHike 1 1.2 
102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) kinase 6.4 
102812 U90549 Hs.236774 high-mobility group (nonhistone chromosomal) protein 17-Oke 3 16 
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102B27 BE2445B8 Hs.6456 
102831 AA262170 Hs.8Q917 
102844 AV653790 Hs.324275 
102868 X02419 Hs.77274 
5 102925 BE440142 Hs.2943 
102935 BE561850 Hs.80508 
102968 AU076611 Hs.154672 
102983 BE387202 Hs.118638 
102985 U95742 Hs.2707 

10 103023 AW500470 Hs.117950 
103038 AA926960 Hs.334883 
103060 NM_005940Hs.155324 
103080 AU077231 Hs.82932 
103089 D31152 Hs.179729 

15 103177 BE244377 Hs.48876 

103178 AA205475 Hs.275865 

103179 NNL001777HS.82685 
103181 X69636 Hs.334731 
103185 NM006825HS.74368 

20 103191 AA401039 Hs.2903 

103193 NMJ)04766Hs.75724 

103194 NIVL004939Hs.78580 
103206 X72755 Hs.77367 
103223 BE275607 Hs.1708 

25 103232 X75962 Hs.129780 
103238 AI369285 Hs.75189 
103297 NM001545HS.9078 
103330 AI803447 Hs.77496 
103349 X89059 

30 103376 AL036166 Hs.323378 

103391 X94453 Hs.114366 

103392 X94563 

103430 BE564090 Hs.20716 
103491 AF264750 Hs.288971 

35 103505 AL031224 Hs.33102 
103547 AI376722 . Hs.180062 
103588 NM.006218HS.85701 
103613 NNL.Q00346Hs.2316 
103621 BE379766 Hs.150675 

40 103622 AA609685 Hs.278672 
103727 AI878883 Hs.296381 
103749 AL135301 Hs.8768 
103754 AI015709 Hs.172089 
103780 AA094752 Hs.169992 

45 103795 H26531 Hs.7367 
103797 AA080912 
103813 AI042582 Hs.181271 
. 103855 W02363 Hs.302267 
103886 AK001278 Hs.105737 

50 104052 NJVL002407HS.97644 
104079 AA251242 Hs.103238 
104174 AA478984 Hs.6451 
104227 A8002343 Hs.98938 
104275 A1751970 Hs.101067 

55 104325 BE379766 Hs.150675 
104370 AA324597 Hs.21851 
104423 R83113 Hs.1432 
104482 AB037762 Hs.44268 
104532 A1498763 Hs.203013 

60 104563 AL117403 Hs.306189 
104667 AI239923 Hs.30098 
104757 AI694413 Hs.332649 
104804 AI858702 Hs.31803 
104806 AB023175 Hs.22982 

65 104827 AW052006 Hs.8551 
104846 AI250789 Hs.32478 
104854 AA041276 Hs.154729 



chaperonin containing TCP1, subunit 2 (beta) 
adaptor-related protein complex 3, sfgma 1 subunit 
WW domain-containing protein 1 
plasminogen activator, urokinase 
signal recognition particle 19kO 
small nuclear ribonudeoprotein polypeptide A 1 



5.6 
2.0 
1.3 
4.4 
1.9 
2.4 



methylene tetrahydrofolate dehydrogenase (NAD+ dependent), methenyitetrahydrofolate cydohydroiaseZ7 

non-metastatic cells 1 , protein (NM23A) expressed in 3.1 

G1 to S phase transition 1 . 5.2 

multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase .1.6 

CDC28 protein kinase 1 2.5 

matrix metalloproteinase 1 1 (MMP11;stromerysln 3} 4.5 

cycJin D1 (PRAD1: parathyroid adenomatosis 1) 3.1 

collagen, type X, alpha 1 (Schmid metaphyseal chondrodysplasia) 2.4 

famesyl-diphosphate famesyltransferase 1 3.5 

ribosomal protein S18 9.9 

CD47 antigen (Rh-related antigen, Integnn-assodated signal transducer) ' 1.3 . 

Homo sapiens, clone IMAGE:3448306, mRNA, partial cds 2.0 

transmembrane protein (63kD), endoplasmic retfculum/Golgi Intermediate compartment 1 .6 

protein phosphatase 4 (formerly X), catalytic subunit 2.5 

coatomer protein complex, subunit beta 2 (beta prime) 2.2 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 6.3 

monokine Induced by gamma Interferon 8.8 

chaperonin containing TCP1 , subunit 3 (gamma) 3.0 

tumor necrosis factor receptor superfarniy, member 4 1 .8 

death-associated protein 5.6 

Immature colon carcinoma transcript 1 1 <9 

small nuclear ribonudeoprotein polypeptide G 2.5 

gb:H^apiens mRNA for unknown protein expressed in macrophage 1.6 

coated vesicle membrane protein 1.8 

pyrro!ine^-carboxytetesynftetase(gta 2.3 

gb:hUaplens dbi/acbp gene exon 1 &2. 4.0 

translocase of inner mitochondrial membrane 17 (yeast) homolog A 1 .3 

myeloid/lymphoid or mixed-lineage leukemia 3 5.6 

transcription factor AP-2 beta (activating enhancer-binding protein 2 beta) 5.1 

proteasome (prosome, macropaln) subunit, beta type, 8 (large multifunctional protease 7) 9.7 

phosphoinositide-3-kinase, catalytic, alpha polypeptide 2.0 

SRY (sex determining region Y)iox 9 (campomelic dysplasia, autosomal sex-reversal) 1.3 

polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 2.0 

membrane component, chromosome 1 1 , surface marker 1 2 .3 

growth factor receptor-bound protein 2 " 1.3 

hypothetical protein FU10849 1.8 

Homo sapiens mRNA; cDNA DKFZp586I2022 (from done DKFZp586l2022) 1.3 

hypothetical 43.2 Kd protein 75 

Homo sapiens BTB domain protein (BDPL) mRNA, partial cds \2 

gb:zn04d03.r1 Stratagene hNT neuron (937233) Homo sapiens cDNA done 5 1 simOar 1.5 

CGM20 protein 1.5 

hypothetical protein FU10330 1.5 

hypothetical protein FU10416 simliarto constitutive photomorphogenlc protein 1 6.5 

mammaglobin2 2.9 

ESTs 1.4 

PRO0659 protein 5.6 

protocadherin alpha 9 1.8 

GCN5 (general control of amino-add synthesis, yeast, homotogHike 2 5.4 

polymerase (RNA) II (DNA directed) polypeptide K (7.0kD) 6.3 

Homo sapiens cDNA FU12900 fis, clone NT2RP2004321 1.6 

protein kinase C substrate 80K-H 5.2 

myelin gene expression factor 2 1 2 

hypothetical protein FU12748 2.1 

DKFZP434F1735 protein 12 . 

ESTs 1.3 

olfactory receptor, family 2, subfamily I, member 6 2.3 

ESTs, Weakly similar to N-WASP [Ksapiens] , 1-3 

KIAA0958 protein 23 

PRP4/STK/WD splicing factor 10.9 

ESTs 5.6 

3-phosphoinositide dependent protein kinase-1 123 
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104867 AA278898 Hs.225979 hypothetical protein sfmflar to small G proteins, especially — 2.0 

104871 T78044 Hs.28893 Homo sapiens mRNA; cDNA DKFZp56402364 {from done DKFZp56402364) IJ 

104896 AW015318 Hs.23165 ESTs 17.7 

104909 AW408164 Hs.249184 transcription factor 19 (SCI) .5.0 

104916 AW958157 Hs.155489 NS1 -associated protein 1 17 

104919 AA026880 Hs.25252 prolactin receptor -14 

104930 AF043467 Hs.32893 neurexophilin 2 22 

104973 NM.015310Hs.6763 KIAA0942 protein . 5.0 

104974 Y12059 Hs.276675 bromodomaln-contalnlng 4 14 

104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeastJ-Hke 1 2.4 
104975 AL136877 Hs.50758 SMC4 (structural maintenance of chromoso 2.3 

104978 AI199268 Hs.19322 Homo sapiens, SimDartoRlKENcDNA2010317£24 gene, clone IMAGE:350201 9, mRNA, partial cds 7^ 

104979 AA937934 .HS.321062 ESTs 1-3 
104994 AI499930 Hs.334885 mitochondrial GTP binding protein - . 3.5 
105009 BE379584 Hs.34789 dolichyWiphosphooligosaccharide^^ 5.5 
105012 AF098158 Hs.9329 chromosome 20 open reading frame 1 3.3 
105028 AI050715 Hs.2331 E2F transcription factor 5, p130-bindtog 2.2 
105032 AA127818 gb:z!12a02.s1 Soaresj>regnanLuterus_NbHPU Homo sapiens cDNA done IMAGE:501674 3' 6.8 
105039 AA907305 Hs.36475 ESTs 2.5 
105041 AB037716 Hs.26204 KIAA1 295 protein 2.2 
105045 BE242899 Hs.1 29951 speckte-type POZ protein 3.8 
105079 AA151342 Hs.12677 CGM47 protein 9.5 

105087 AA147884 Hs.9812 Homo sapiens cDNA RJ14388 fls. done HEMBA1002716 5.6 

105088 H58589 Hs.35156 Homo sapiens cDNA FU1 1027 fe, done PLACE10041 14 22 
105095 Z78407 Hs.27023 vesWe transport-related protein 2.2 
105110 BE387350 Hs.33122 KIAA1 160 protein ■ 1.6 

105126 AW975433 Hs.36288 ESTs &3 

105127 AA045648 Hs.301957 nudix (nudeoside diphosphate linked moiety X)-type motif 5 2.1 
105141 AA1646B7 Hs.177576 manncayHalpha-l^lycoproteto 2.7 
105158 AW976357 Hs.234545 hypothetical protein NUF2R 1 -9 
105169 BE245294 Hs.180789 S164 protein 17 
105186 AA191512 Hs.28C05 Homo sapiens cDNA FU1 1309 (is, done PLACE1 010076 4.8 
105254 AA071276 Hs.19469 KIAA0859 protein 1.9 
105281 AA263143 Hs.24596 RAD51-interacting protein 2.8 
105288 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ [H^apiens] 1 .9 
105302 AA700122 Hs.3355 sentrin-spedfic protease 3-0 
105331 AW270037 Hs.179507 KIAA0779 protein 1-6 
105359 NWL016015HS.8054 CGl-68 protein &- 2 
105366 BE264645 Hs.282093 hypothetical protein FU21918 5.0 

105373 AW887701 Hs.32356 hypothetical protein FU20628 2.5 

105374 BE242803 Hs.262823 hypothetical proteui FU10326 2.2 
105387 AW592146 Hs.108636 membrane protein CH1 2.3 
105393 AF167570 Hs.256583 interieukin enhancer binding factor 3, 90kD 5.4 

105399 BE386877 Hs.334811 Npw38-binding protein NpwBP 1.6 . 

105400 AF198620 . Hs.65648 RNA binding motif protein 8A 1.6 
105445 AA252395 gb:zs12g10.s1 NCI_CGAP_GCB1 Homo sapiens cDNA done IMAGE:685026 3', mRNA sequence. 5.0 
105507 BE268348 Hs.226318 CCR4-NOT transcription complex, subunit 7 1.6 

105529 AA1 13449 Hs.32471 hypothetical protein FU20364 1-3 

105530 AB023179 Hs.9059 KIAA0962 protein 3.4 
105547 AA262640 Hs.27445 unknown 9 «3 
105564 BE616694 Hs.288042 hypothetical protein FU14299 1 4 

105596 AA579535 Hs.1B490 hypothetical protein FU20452 10.9 

105597 AF054284 Hs.334826 splldngfador 3b, subunit 1,1 55kD 2.9 
105608 AI808201 Hs.287863 hypothetical protein FU 12475 17 
105610 AA280072 Hs.99872 fetal Alzheimer antigen 14 
105617 AK000892 Hs.4069 glucocorticoid modulatory element binding protein 1 17 
105620 AW302245 Hs.181390 casein kinase 1, gamma 2 5.5 
105658 AA985190 Hs.246875 hypothetical protein FU20059 . 9.4 
105697 AW499988 Hs.27801 zinc finger protein 278 2.0 
105708 R26944 Hs.1 80777 Homo sapiens mRNA; cDNA DKFZp564M0264 (from done DKFZp564M0264) 17 
105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short 2.6 
105746 AW151952 Hs.46679 hypothetical protein FU20739 1.5 
105759 AI123118 Hs.15159 chemokine-like factor, alternatively spliced 1.3 
105771 A1267720 Hs.153221 synovial sarcoma, translocated to X chromosome 1.6 
105820 AA741336 Hs.152108 transcriptional unit N143 2.2 
105B26 AA478756 Hs.194477 E3 ubiquitin iigase SMURF2 1.3 
105856 AI262106 Hs.12653 ESTs 2.4 
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105858 AF151066 Hs.281428 hypothetical protein _ 2.9 

105875 AK001708 Hs.32271 hypothetical protein FU 10846 ™ 

105930 AF016371 Hs.9880 peptldyl prolyl Isomerase H {cydophifin H) 5.2 

106000 AW194426 Hs.20726 ESTs »= J-J 

5 ■ 106011 AW081202 Hs.12284 Homo saptens, done IMAGE:2989556, mRNA, partial cds 2.8 

106017 AA477956 Hs.26268 ESTs V \ 

106073 AL157441 Hs.17834 downstream neighbor of SON M 
106078 AA130158 Hs.19977 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.6 

108094 AA533491 Hs.23317 hypothetical protein FU14681 6.8 

10 106140 AB006624 Hs.14912 KIAA0286 protein 1* 

106271 AA251393 Hs.289052 Homo sapiens, Similar to RIKEN cDNA 6430429M05 gene, clone MGCH3155, mRNA, complete cds 10.8 

106288 AB037742 Hs.24336 KIAA1321 protein 1.3 

106300 Y10043 Hs.19114 high-mobility group (nonhistone chromosomal) protein 4 3.6 

106333 AL043114 Hs.22410 ESTs, Weakly similar to A54849 collagen alpha 1 (VII) chain precursor [H.saplens] 5.4 

15 106350 AK001404 Hs.194698 cyclin B2 5.7 

106359 AW390282 Hs.31130 transmembrane 7 superfamily member 2 6-3 

106381 AB040916 Hs.24106 KIAA1483 protein 6 -5 

106389 AW748420 Hs.6236 Homo sapiens cDNA: FU21487 fis, clone COL05419 22 . 

106457 AF1 19256 Hs.27801 zinc finger protein 278 2.7 

20 106470 D63078 Hs.1 86180 Homo sapiens cDNA: FLJ23038 fis, clone LNG02039 2.3 

106531 AA454036 Hs.8832 ESTs 1 -<J 

106586 AA243837 Hs.57787 ESTs IS 

106589 AK000933 Hs.28661 Homo sapiens cDNA FLJ10071 fis, clone HEMBA1001702 2.4 

106610 AA4588B2 Hs.79732 fibulfn 1 7 -9 

25 106624 NM_003595Hs.26350 tyrosylprotein sulfotransferase 2 7.7 

106650 AL049951 Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (from done DKFZp564O0122) 1.8 

106669 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha-1C-adrenergic receptor spBce form 2 [H^apiens] 1.3 
106713 BE614802 Hs.184352 hypothetical protein FU12549 

106717 AA600357 Hs.239489 TIA1 cytotoxic granule-associated RNA-btoding protein 1-3 

30 106723 BE388094 Hs.21857 ESTs *■ W 

106795 AF174487 Hs.293753 Bcl-2-related ovarian killer proteln-Oke 5.7 

106829 AW959893 Hs.27099 hypothetical protein FU23293 similar to ARL-6 Interacting protein-2 16.2 

106831 BE564871 Hs,29463 centrin, EF-hand protein, 3 (COC31 yeast homolog) 1.5 

106846 AB037744 Hs.34892 KIAA1323 protein 2.2 

35 106852 AF151031 Hs.300631 hypothetical protein 1«3 

106873 N49809 Hs.11197 Homo sapiens, clone IMAGE:33431 49, mRNA, partial cds 16.8 

106886 W79171 Hs.9567 GL002 protein 1 -5 

106906 AA861271 Hs.222024 transcription factor BMAL2 2-2 

106920 AK001838 Hs.296323 serum/glucocortlcoid regulated kinase 3.3 

40 106945 AK000511 Hs.6294 hypothetical protein DKF2p434L1435 similar to valyltRNA synthetase 6.8 

106973 BE156256 Hs.1 1923 hypothetical protein 6.6 

106977 AL043152 Hs.50421 KIAA0203 gene product 4.8 

106978 AW631480 Hs.8688 ESTs Jg 
107004 AA146872 Hs.300700 hypothetical protein FU20727 1-3 

45 107029 AF264750 Hs.288971 myeloid/lymphoid or mixed-lineage leukemia 3 1.8 

107071 AW385224 Hs.35198 ectonucleotide pyrophosphatase/phosphodiesterase 5 (putative function) .1.7 

107113 AK000733 Hs.23900 GTPase activating protein 2.5 

107125 AK000512 Hs.69388 hypothetical protein FU20505 1.7 

107136 AV661958 Hs.8207 GK001 protein 4 j> 

50 107136 AV661958 Hs.8207 GK001 protein 3.3 

107146 AK001455 Hs.5198 Down syndrome critical region gene 2 2.0 

107151 AW378065 Hs.8687 ESTs W 

107155 AW391927 Hs.7946 KIAA1 288 protein 33.5 

107174 BE122762 Hs.25338 ESTs 5.2 

55 107197 W15477 Hs.64639 glioma pathogenesls-related protein 6.1 

107221 AW888411 Hs.81915 leukemia-associated phosphoprotein p18 (stathmln) 17.4 

107243 BE219716 Hs.34727 ESTs, Moderately similar to I38759 zinc finger/leucine zipper protein [H.saplens] 7.4 

107248 AW263124 Hs,315111 nuclear receptor co-repressor/HDAC3 complex subunit 1.8 

107263 D60341 Hs 21198 transferase of outer mitochondrial membrane 70 (yeast) homolog A 6.6 

60 107265 BE379594 Hs 49136 ESTs, Moderately similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1_CAEEL HYPOTHETICAL 63.5 KD PROTEIN ZK353.1 IN 2.5 

107298 N95657 Hs.6820 ESTs, Moderately similar to YOJ1 JCAEEL H 1.7 

107299 BE277457 Hs.30661 hypothetical protein MGC4606 32 
107316 T63174 Hs.1 93700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from clone DKF2p586l0324) 2.0 

65 107354 NWL006299HS.96448 zinc finger protein 193 5.0 

107392 AW299900 Hs.267632 TATA element modulatory factor 1 \2 

107481 AA307703 Hs.279766 Kinesin family member 4A 1.6 
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107529 BE515065 Hs.296585 nucleolar protein (KKE/D repeal) 3.0 

107554 AA001386 Hs.59844" ESTs W 

107681 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATIONS 

107772 AA018587 Hs.303055 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 2.1 

107859 AW732573 Hs.47584 potassium voltage-gated channel, delayed-rectifier, subfamily S, member3 8.4 

107901 L42612 Hs.335952 keratin 6B . . . 2.5 

107901 L42612 Hs.335952 keratin 6B .1.6 

107922 BE153855 Hs.61460 Ig superfamily receptor LNIR , . 2.2 

107974 AW956103 Hs.61712 pyruvate dehydrogenase kinase, Isoenzyme 1 6.7 

108040 AL121031 Hs.159971 SWI/SNF related, matrix associated, actin dependent regulatorofchromatin,subfamliyb, member 1 15 

108230 AA054224 Hs.59847 ESTs « 

108274 AF129535 HsJ272027 F-box only protein 5 7.1 

108296' N31256 Hs.161623 ESTs 2-5 

108496 AA083069 Hs.339659 ESTs U 

108607 BE300380 Hs.69476 Homo sapiens cDN A FU1 2758 fls, clone NT2RP2001 328 3.4 

108621 AA101809 Hs.182685 ESTs 1 - 6 

108634 AW022410 Hs.69507 ESTs 17 

108647 BE546947 Hs.44276 homeoboxCIO 9.8 . 

108695 AB029000 Hs.70823 KIAA1 077 protein 7.2 

108717 AA122393 Hs.70811 ' hypothetical protein FU20516 1-3 

108740 AI089575 Hs.907,1 progesterone membrane binding protein 2.7 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 1-8 

108859 AL121500 Hs.178904 ESTs J«5 

108872 H06720 . Hs.1 11680 endosulfine alpha 2.1 

108891 AI801235 Hs.48480 ESTs 5.3 

108894 AK001431 Hs.5105 hypothetical protein FU1 0569 4.0 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system N2 (SN2) mRNA, complete cds 5.6. 

108982 AA151708 Hs.171980 homeo box (expressed in ES cells) 1 16 

108987 AA152178 Hs.23467 hypothetical protein FU1 0633 6.2 

109002 AB028987 Hs72134 K1AA1064 protein 1.7 

109011 AA156542 Hs.72127 ESTs ™ 

109026 AA157811 gb:zo35d07.s1 Stratagene colon (937204) Homo sapiens cDNA done 3' similar to contains Alu repetitive 5 

109068 AA164293 Hs.72545 ESTs 2.9 

109101 AW608930 Hs.52184 hypothetical protein FU20618 1.6 

109112 AW419196 Hs.257924 hypothetical protein FU13782 3.2 

109124 AK000684 Hs.183887 hypothetical protein FU22104 1-7 

109139 AJ132592 Hs.59757 ztnc finger protein 281 2.6 

109166 AA219691 Hs.73625 RAB6 Interacting, klnesin-like (rabkinesin 6) 2.9 

109198 BE566742 Hs.58169 highly expressed in cancer, rich In leucine heptad repeats 2.0 

109213 NMJ>16603Hs.82035 potential nuclear protein C50RF5; GAP-like protein 5.3 

109220 AW958181 Hs.189998 ESTs 5.7 

109233 AU077281 Hs.170285 nucleoporin 214kD (CAIN) 5.3 

109270 N99673 Hs.3585 ESTs, Weakly similar to AF126743 1 DNAJ domain-containing protein MCJ [H^apiens] 1.4 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586Fl822 (from clone DKFZp586F1822) 2.9 

109313 AF153201 Hs.86276 C2H2 (Kruppel-type) zinc finger protein 1.3 

109341 AA213506 Hs.1 15099 EST 25 

109391 AL096858 Hs. 1 84245 K1AA0929 protein Msx2 interacting nuclear target (MINT) homotog 1.5 

109420 H83603 Hs.40408 homeo box C9 2.2 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subunit 6 3.0 

109429 AI160029 Hs.61438 ESTs . ]* 

109445 AA232103 Hs.189915 ESTs 1* 

109450 AB032969 Hs.173042 K1AA1 143 protein 3.7 

109468 NM 015310HS.6763 KIAA0942 protein 35 

109478 AW074143 Hs.87134 ESTs . 2-0 

109570 L40027 Hs.1 18890 glycogen synthase kinase 3 alpha 2.1 

109662 F02614 Hs.27319 ESTs M 

109825 R71264 Hs.1679B ESTs 1 ^ 

110039 H11938 Hs.21907 histone acetyltransferase .2.0 

110056 AA503041 Hs.279009 matrix Gla protein 2.5 

110085 AA603840 Hs.29956 KIAA0460 protein 1.7 

110110 T07353 Hs.7948 ESTs 2.9 

110129 R51853 Hs.226429 ESTs, Weakly similar to ALU1J4UMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION . 1.7 

110154 NNLG14521HS.17667 SH3-domain binding protein 4 4-2 
110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOCHROME P450 4A1 1 PRECURSOR [H^aplens) 4.2, 

110242 N41744 Hs.19978 CGl-30 protein 1.3 

110259 H28428 Hs.32406 ESTs, Weakly similar to I38022 hypothetical protein [H^aplens] 22 

110312 BE256986 Hs.1 1896 hypothetical protein FU12089 . 2.1 
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110330 AI28B666 Hs.16621 DKFZP434I1 16 protein 6.2 

110501 H55748 gb:yq94a01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cONA clone IMAGE203400 3'~ .6.1 

110504 H55915 Hs.210859 hypothetical protein FU1 1016 6.1 

110525 H57330 Hs.37430 EST . 6.3 

5 110568 AK001160 Hs.5999 hypothetical protein FU10298 ■ 1-8 

110699 T97586 Hs.18090 ESTs 1.8 

110705 AB007902 Hs.32168 KIAA0442 protein 1.6 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 7.6 

110761 AL138077 Hs.16157 hypothetical protein FU12707 ' Z5 

10 110762 BE044245 Hs.30011 hypothetical protein MGC2963 9.3 

110765 AK000322 Hs.18457 hypothetical protein RJ2031 6 5.5 

110769 BE000831 Hs.23837 Homo sapiens cONA FU11B12 fis, clone HEMBA1 006364 2.1 

110799 AI089660 Hs.323401 dpy-30-like protein 1-5 

110805 T25829 Hs.24048 FK506 binding protein precursor 6.6 

15 110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION. 5.7 

1 10820 R33261 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [Ksaptens] 3.4 

110840 N31598 Hs.12727 hypothetical protein FU21 610 1.7 

110844 AI740792 Hs.167531 methylcrotonoyl-Coenzyme A carboxylase 2 (beta) 1.7 

110854 BE612992 Hs.27931 hypothetical protefo FU106O7.similar to glucosamlne-phosphateN^acetyltransferase 4.7 

20 110856 AA9923B0 ,gb:ot37g06.s1 SoaresJestisJNHT Homo sapiens cDNA done 3' similar to contains element 23 

110885 BE384447 Hs.16034 hypothetical protein MGC131 86 35 

110897 AL117430 Hs.6880- DKFZP434D1 56 protein 22 

110915 BE092285 Hs.29724 hypothetical protein FU 13187 2.6 ' 

110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expression In cancer [H.sapiens] 1.9 

25 110958 NM_005664Hs.24587 signal transduction protein (SH3 containing) 6.7 

110963 AK002180 Hs.11449 DKFZP56401 23 protein 2.0 

110981 AK001980 Hs.24284 ADP-ribosyltransferase (NAD^ poty(ADP-ribose) polymerase)-lite 2 U 

110984 AW61 3287 Hs.801 20 UDP-N^cetyMprta-D^alartosamme;polyper^^ 118 

111125 N63823 Hs.269115 ESTs, Moderately similar to 2195_HUMAN ZINC FINGER PROTEIN 195 [H^apiens] 3.6 

30 111132 AB037807 Hs.83293 hypothetical protein 2.1 

111164 N46180 Hs.122489 Homo sapiens cDNAFU13289fs ( clone OVARC1001170 22 

111172 R67419 Hs.21851 Homo sapiens cDNA FU 12900 fis, clone NT2RP2004321 3.7. 

111174 AL050166 Hs.26295 Homo sapiens mRNA;cDNADKFZp586D1122 (from done DKFZp586D1 122) 7.5 

111179 AK000136 Hs.10760 asporin (LRR class 1) 7.1 

35 111184 AI815486 Hs.243901 Homo sapiens cDNA FLJ20738 fis, done HEP08257 6.7 

111184 AI815486 Hs.243901 Homo sapiens cDNA FU20738 fis, done HE 3.3 

1 1 1 189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse transcriptase homotog [H^aplens] 3.6 

111216 AW139408 Hs.152940 ESTs 1.5 

111221 AB037782 Hs.15119 KIAA1361 protein 2.6 . 

40 111223 AA852773 Hs.334838 KIAA1866 protein 4.6 

111239 N90956 Hs. 17230 hypothetical protein FU22087 7.9 

111285 AA778711 Hs.4310 eukaryotic translation initiation factor 1A 6.9 

111299 AB033091 Hs.74313 K1AA1 265 protein 5.0 

111312 AI523913 Hs.34504 ESTs 3.8 

45 111318 T99755 Hs.334728 ESTs 1.2 

1 11337 AA837396 Hs.263925 LIS1 -interacting protein NUDE1, rat homolog 5.1 

111352 H58589 Hs.35156 Homo sapiens cDNA FLJ11027 fis, clone PLACE1004114 22 

111370 A1478658 Hs.94631 brefeldin A-inhibited guanine nudeotide-exchange protein 1 2.8 

111384 N94606 Hs.288969 HSCARG protein 2.2 

50 111389 AK000987 Hs.169111 oxidation resistance 1 2.1 

111391 NMJX)3896Hs.225939 sialyltransferase 9 (CMP-NeuAdactosyteeramide alpha-2,3-siatyltransferase; GM3 synthase) 5.1 

111392 W46342 Hs.325081 Homo sapiens, clone IMAGE:3659680, mRNA, partial cds 8.4 
111452 R02354 Hs.15999 ESTs 2.7 
111486 AI051194 Hs.227978 EST 6.5 

55 111549 W90638 Hs.20321 ESTs, Moderately similar to ZRF1_HUMAN ZUOTIN RELATED FACTOR-1 (M-PHASE . 1.4 . 

111585 R10720 Hs.20670 EST 1.6 

111627 R52656 Hs.21691 ESTs . 1.6 

111670 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, partial cds 2.4 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp564DQ16 (from done DKFZp564D016) ' 10.6 

60 111944 AW083791 Hs.21263 suppressor of potassium transport defect 3 6.6 

111987 NM.015310Hs.6763 KIAA0942 protein 5.1 

112134 R41823 Hs.7413 ESTs; calsyntenin-2 2.8 

112244 AB029000 Hs,70823 KIAA1077 protein 14,6 

112388 R46071 Hs.301693 Homo sapiens, done IMAGE:3638994, mRNA, partial cds 9.0 

65 112456 NM_016248Hs.232076 A kinase (PRKA) anchor protein 11 1.4 

112464 AW007287 Hs.28538 Homo sapiens cDNA: FU21 086 fis, done CAS03272 1,4 

112506 AI742756 Hs.26079 ESTs 3.2 
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12513 R68425 Hs.13809 hypothetical protein FU1 0648 2.0 

12752 AK001635 Hs.14838 hypothetical protein RJ10773 1.8 

12884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein, partial cds 6.6 

12923 T10258 Hs.5037 EST * 1.5 

12938 AW970826 Hs.6185 K1AA1557 protein 3.2 

12958 R61388 Hs.6724 ESTs ' . 6.0 

12966 Z44718 Hs.102548 glucocorticoid receptor DNA binding factor 1 6.4 

12978 AK000272 Hs7099 hypothetical protein RJ20265 1.2 

12995 AA737033 Hs.7155 ESTs, Moderately similar to 2116357A TYKl protein [M.muscutus] 5.6 

12996 BE276112 Hs.7165 zinc finger protein 259 2.0 
13047 AI571940 Hs.7549 ESTs 1.9 
13049 AW965190 Hs.7560 Homo sapiens mRNA for KIAA1729 protein, partial cds 2.4 
13089 T4O707 Hs.270862 ESTs 1.3 
13196 T57317 gb:yb51a03.s1 Stratagene fetal spleen (937205) Homo sapiens cDNA clone IMAGE:74668 3', 1.7 
13248 T63857 gb:yc16e01.s1 Stratagene lung (93721 0) Homo sapiens cDNA clone 3*. mRNAsequence 2.8 
13254 AK002180 Hs.11449 DKFZP56401 23 protein 1.3 
13277 AW971049 Hs.11774 protein (peptidyl-prolyl cis/trans isomerase) NIMA-interadng, 4 (parvulin) 3.2 
13429 AA688021 Hs.179808 ESTs 1-2 
13499 AI467908 Hs.8882 ESTs 5.9 
13547 H59588 Hs.15233 ESTs 1 20 
13554 AW503990 Hs.142442 HP1-BP74 3.6 
13647 AA813887 Hs.188173 Homo sapiens cDNA FU12187 fis, clone MAMMA1000831 US 
13702 T97307 gb:ye53h05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:121497 3\ 4.4 
13722 AV653556 Hs.184411 albumin 1.3 
13759 AW499665 Hs.9456 SWI/SNF related, matrix associated, adin dependent regulatorof chromatin, subfamily a, members M 
13777 BE266947 Hs.10590 zinc finger protein 313 13-4 
13783 AL359588 Hs.7041 hypothetical protein DKFZp762B226 1.7 
13791 A1269096 Hs.135578 chitoblase, di-N-acetyl- 1.3 
13808 W44735 Hs.9286 Homo sapiens cDNA: FU21278 As, done COL01832 3.3 
13811 BE207480 Hs.6994 Homo sapiens cDNA: FU22044 fis, clone HEP09141 " 3.1 
13817 H13325 Hs.332795 hypothetical protein DKFZp761 017121 3.2 
13826 AW378212 Hs.24809 hypothetical protein FU1 0826 2.3 
13834 T26483 Hs.6059 EGF-oontalnlng Iibulin4fke extracellular matrix protein 2 11.3 
13868 W57902 Hs.90744 proteasome (prosome, macropain) 26S subunii non-ATPase, 11 2.7 
13870 AL079314 Hs.16537 hypothetical protein, similar to (U06944) PRAJA1 6.1 
13885 AW959486 Hs.21732 ESTs 6.6 
13923 AW953484 Hs.3849 hypothetical protein FU22041 similar to FK506 binding proteins 1.9 
13989 W87544 Hs.268828 ESTs 1 2 
14022 AI539519 Hs.120969 Homo sapiens cDNA FLJ1 1562 fis, clone HEMBA1 003197 5.4 
14030 AI825386 Hs.164478 hypothetical protein FLJ21939 similar to 5-azacytidine Induced gene2 9.4 
14060 AB029551 Hs.7910 . RING1 and YY1 binding protein 1.8 
14196 AF017445 Hs.150926 fucose-1 -phosphate guanylyltransferase 1.5 
14226 AB028968 Hs.7989 K1AA1045 protein 1.8 
14253 BE149866 Hs.14831 Homo sapiens, Similar to zinc finger protein 136 {done pHZ-20), clone MGC:10647, mRNA, complete cds 
14262 AL1 17518 Hs.3686 KIAA0978 protein 1.4 
14275 AW515443 Hs.306117 KIAA0306 protein 15.8 
14292 AI815395 Hs.184641 fatty acid desaturase 2 1.9 
14309 AA332453 Hs.20824 CGt-85 protein 2.4 
14392 AA249590 Hs.100748 ESTs, Weakly similar to A28996 prollne-rich protein M14 precursor - mouse {M.musculus] 1.8 
14407 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (from done DKFZp434B0425) M 
14455 H37908 Hs.271616 ESTs, Weakly similar to ALU8.HUMAN ALU SUBFAMILY SX SEQUENCE 5.5 

14463 AL120247 Hs.40109 K1AA0872 protein 5.2 

14464 AI091713 Hs.106597 Homo sapiens, Similar to RIKENcONA 1110012M11 gene, done IMAGE:3688605, mRNA, partial cds 15 
14471 AA028074 Hs.104613 RP42homotog 1.8 
14480 BE066778 Hs.151678 UDP-N-acetyl-alpha-D^alactosamlne:polypeptideN-acety^ 13.4 
14671 AA766268 Hs.266273 hypothetical protein FU 13346 1.9 
14698 AA476966 Hs.1 10857 polymerase (RNA) lit (DNA directed) polypeptide K (12.3 kDa) 3.5 
14730 AI373544 Hs.331328 Intermediate filament protein syncollin 3.8 
14767 AI859865 Hs.154443 minlchromosome maintenance deficient (S. cerevisiae) 4 1.6 
14774 AV656017 Hs.184325 CGI-76 protein 3.1 
14798 AA159181 Hs.54900 serologically defined colon cancer antigen 1 3.5 
14860 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.3 

14895 AA236177 Hs.76591 K1AA0887 protein 7.1 

14896 BE539101 Hs.5324 hypothetical protein 1.3 
1491 1 AA236672 gb:zt29f02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:723771 3', mRNA sequence. 
14930 AA237022 Hs.188717 ESTs 2.0 
14938 AA242834 Hs.58384 ESTs 2.9 
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114965 AI733881 Hs.72472 BMP-R1B 2.3 

115023 AF102546 Hs.63931 dachshund (Drosophila) homolog .1.3 

115038 AA252360 Hs.87968 toil-like receptor 9 1 - 6 

115081 AI751438 Hs.41271 Homo sapiens mRNA full length Insert cDNA clone EUROIMAGE 1913076 . 11.8 

5 115082 AA253314 Hs.154103 UM protein (similar to rat protein Wna 1-5 

115117 AI670847 Hs.5324 hypothetical protein .. . 16 

115121 AI634549 Hs.88155 ESTs J» 

115206 AW183695 Hs.186572 ESTs f! 

115221 AW365434 Hs.79741 hypothetical protein FU 101 16 1-J 

10 115239 BE251328 Hs.73291 hypothetical protein FU 10881 W 

' 115242 AI368236 Hs.283732 ESTs, Moderately similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE 1.4 

115278 AK002163 Hs.301724 hypothetical protein FU1 1301 1-5 

115285 AW972872.HS.293736 ESTs J* 

115291 BE545072 Hs.1 22579 hypothetical protein FU10461 • 

15 115400 AI215069 Hs.89113 ESTs W 

115468 AA314349 Hs.48499 tumor antigen SLP-8p 7A 

115471 AK001376 Hs.59346 hypothetical protein FU10514 ! 

115479 AW301608 Hs.278188 ESTs, Moderately similar to I54374 gene NF2 protein [Ksapiens] ' 4.0 

115496 AW247593 Hs.71819 eukaryotic translation initiation factor 4E binding protein 1 16.3 

20 115500 Y14443 Hs.88219 zinc finger protein 200 

115553 AJ275986 Hs.71414 transcription factor (SMIF gene) 2.5 

115581 AI540842 Hs.61082 ESTs J J 

115587 BE081342 Hs.283037 HSPC039 protein 2.9 

115590 AA399477 Hs.67896 7-60 protein - 5.3 

25 115646 N36110 Hs.305971 soiute carrier family 2 (facilitated glucose transporter), member 10 4.7 

115652 BE093589 Hs.38176 hypothetical protein FU23468 Jjw 

115655 AL048269 Hs.288544 Homo sapiens, clone MGC:16063, mRNA, complete cds 1Z.7. 

115663 AI138785 Hs.40507 ESTs 2.0 

115676 AA953006 Hs.88143 ESTs ■ 

30 115690 AA625132 Hs.44159 hypothetical protein FLJ21615 y 

115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type receptor 3, flamingo (Drosophila) homotog 6.8 

115715 BE395161 Hs.1390 proteasome (prosome, macropain) subunit beta type, 2 1.7 

115734 AI950339 Hs.40782 ESTs -5 

115811 NMJ)15434Hs.48604 DKFZP434B1 68 protein 2.1 

35 115823 AI732742 Hs.87440 ESTs 2.1 

115837 AI675217 Hs.42761 ESTs \* 

115844 AI373062 Hs.332938 hypothetical protein MGC5370 JJ 

115866 AW062629 Hs.52081 KIAA0867 protein 7.2 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 1-2 

40 115941 AI867451 Hs.46679 hypothetical protein FU20739 5'o 

115968 AB037753 Hs.62767 KIAA1332 protein JJ. 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 i- 4 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2005735 2.4 

116108 AA770688 Hs.28777 H2Ahistone family, member L 1-{j 

45 116134 BE243834 Hs.50441 CGI.04 protein ]•« 

116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypothetical protein KIAA0684 (H.saptens] 1.2 

116195 AW821113 Hs.72402 ESTs \\ 

116238 AV660717 Hs.47144 DKFZP586N0819 protein 

116246 AF265555 Hs.250646 baculoviral IAP repeat-containing 6 J-J 

50 116262 AI936442 Hs;59838 hypothetical protein FU10808 y 

116298 AI955411 Hs.94109 Homo sapiens cDNA FU13634 fis, done PLACE1011133 1.9 

116318 AF097645 Hs.58570 deleted In cancer 1; RNA heflcase HDB/DICE1 4.9 

116325 AI472106 Hs.49303 Homo sapiens cDNAFUl 1663 fis, clone HEMBA1004631 1.4 

116336 AL133033 Hs.4084 KIAA1025 protein \\ 

55 116339 AK000290 Hs.44033 dipeptidyt peptidase 8 1 «j 

116350 AA497129 Hs.184771 nuclear factor l/C (CCAAT-binding transcription fector) 1.9 

116358 AM49586 Hs.38125 interferbn-induced protein 75, 52kD 1.9 

116365 N50174 Hs.46765 ESTs , 7k 

116368 N90466 Hs.71109 KIAA1229 protein 1* 

60 116417 AW499664 Hs.12484 Human clone 23826 mRNA sequence 7.4 

116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 2.1 

116462 AF218313 Hs.236828 putative helicase RUVBL 1.5 

116470 AI272141 Hs.83484 SRY (sex determining region YHrax 4 2.1 

116470 A1272141 Hs.83484 SRY (sex determining region Y>box 4 1.2 

65 1 16575 AA312572 Hs.6241 phosphoinositide^-kinase, regulatory subunit, polypeptide 1 (p85 alpha) 1.5 

116637 AK001043 Hs.92033 IntegrMnked klnase-assodated serine/threonine phosphatase 2C 2.7 

116640 X89984 Hs.211563 B-cell CLL/lymphoma 7A 2.3 
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116700 AI800202 Hs.317589 hypothetical protein MGC10765 1.4 

116706 AW074819 Hs.12313 hypothetical protein FU 14566 3.4 

116732 AW152225 Hs.165909 ESTs, Weakly simflar to I38022 hypothetical protein [H.sapiens] 2.9 

116921 AW068115 Hs.821 bigrycan * . 8.3 

5 116926 H73608 Hs.290830 ESTs ™ 

117034 U72209 Hs.1 80324 YY1*ssociated factor 2 • - .3.4 

117132 A1393666 Hs.42315 p10-blnding protein 5.2 

1 17247 N21032 gb:yx46fD6.s1 Soares melanocyte 2NbHM Homo sapiens cDNA clone IMAGE:264803 3', mRNA sequence. 5.5 

117276 N71 183 Hs.121806 Homo sapiens cDNAFU11971 fis, clone HEMBB1001208 . 15 

10 117284 AK001701 Hs.183779 Homo sapiens cDNA FU10590 fis, done r^2RP2004392,vireaWy similar to MNN4 PROTEIN 2.0 

117367 AI041793 Hs.42502 ESTs 2.0 

117368 AI878942 Hs.90336 ATPase, H+ transporting, lysosomal (vacuolar proton pump), member J 2.1 
117382 AF150275 Hs.40173 ESTs 2.7 
117412 N32536 Hs.42645 solute carrier family 16 (monocarboxylic acid transporters), member6 1.4 

15 117557 AF123050 Hs.44532 diublquhin 3.4 

117588 N34895 Hs.44648 . ESTs 3.4 

117745 BE294925 Hs.46680 CGI-12 protein . 3.0 

117754 AA121673 Hs.59757 zinc finger protein 281 1-9 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 1.8 

20 117881 AF161470 Hs.260622 butyrate^duced transcript 1 ( 5.7 , 

117904 BE540675 Hs.332938 hypothetical protein MGC5370 5.9 

117911 AL137379 Hs.47125 hypothetical protein FU1391 2 1.7 

117933 Y10518 Hs.116470 hypothetical protein FU20048 1.7 

117983 AL110246 Hs.47367 KIAA1785 protein 5.4 

25 118078 N54321 Hs.47790 EST 5.2 

118301 AA453902 Hs.293264 ESTs 2.6 

118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 2.5 

118472 AL157545 Hs.42179 bromodomain and PHD finger containing, 3 4.1 

118488 AJ277275 Hs.50102 rapa-2 (rapa gene) , 1.2 

30 118509 N22617 Hs.43228 Homo sapiens cDNA FU11835 fis, clone HEMBA1006595 15 

118528 AI949952 Hs.49397 ESTs 7.4 

118656 AI458020 Hs.293287 ESTs 2.5 

118670 AA332845 Hs.152618 ESTs, Moderately similar to ZN91_HUMAN ZINC FINGER PROTEIN 91 Qisaplens] 1.2 

118698 AB033113 Hs.50187 KIAA1 287 protein 2.1 

3 5 1 18737 AA199686 gb:zq75g09.r1 Stratagene hNT neuron (937233) Homo sapiens cONA clone IMAGE:647488 5' 5.2 

118925 N92293 . Hs.206832 ESTs, Moderately similar to ALU8JJUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 1.4 

118984 AI688709 Hs.240722 ESTs, Moderately similar to ALU8_HUMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 3.6 

118986 AF148713 Hs.125830 Wadder cancer overexpressed protein 4.8 

119206 W24781 Hs.293798 KIAA1710 protein 1.7 

40 119235 AW453069 Hs.3657 activity-dependent neuroprotective protein 2.2 

119235 AW453069 Hs.3657 acSvHy-dependent neuroprotective prate 1.6 

119265 BE539706 Hs.285363 ESTs . * 1-4 

119279 N57568 Hs.48028 EST 25.1 

119298 NM_001241Hs.155478 cydinT2 1.6 

45 119338 AI417240 Hs.320836 ESTs, Weakly similar to A47582 B-cell growth factor precursor [H^aplens] 1.3 

119349 T65004 Hs.1 63561 ESTs 8.4 

.119403 AL1 17554 Hs.1 19908 nucleolar protein NOP5/NOP58 6.7 

. 119478 AI624342 Hs.1 70042 ESTs 2.4 

119486 AI796730 Hs.55513 ESTs 2.1 

50 119513 W37933 Empirically selected from AFFX single probeset 1.9 

119601 AK000155 Hs.91684 Homo sapiens mRNA; cDNA DKFZp667l103 (from clone DKF2p667l103) 3.7 

119602 AW675298 Hs.233694 hypothetical protein FU1 1350 3.0 
119676 AA243837 Hs.57787 ESTs M 
119682 W61019 Hs.57811 ESTs 12 

55 119774 AB032977 Hs.6298 KIAA1 151 protein 1.8 

119780 NM_016625Hs.191381 hypothetical protein 3.1 

119789 BE393948 Hs.50915 kailikrein 5 (KLK5; KLK-L2; stratum comeum tryptic enzyme) 92 

119805 AJ223810 Hs.43213 ESTs, Weakly similar to IEFSJHUMAN TRANSFORMATION-SENSITIVE PROTEIN IEF SSP 3.6 

119818 AA130970 Hs.58382 hypothetical protein FU1 1101 25 

60 119863 AA081218 Hs.58608 Homo sapiens CDNAFLJ14206 fis, clone NT2RP30031 57 2.7 

119905 AW449064 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Oanlos syndrome type IV, autosomal dominant) 2.6 

119966 AA703129 Hs.58963 ESTs 2.7 

120132 W57554 Hs.1 25019 lymphoid nuclear protein (LAF-4) mRNA 1.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 

65 120248 AI924294 Hs.173259 uncharacterized bone marrow protein BM033 * 1.2 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 385 

120269 AW131940 Hs.104030 ESTs 9.6 
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120274 AA177051 gb:nc02a02.s1 NCLCGAP_Pr3 Homo sapiens cDNA clone IMAGE:194 similar to contains Alu 4.6 

120280 AA190577 gb:zp52g02.s1 Sbalagene HeLa cell s3 937216 Homo sapiens cONA clone 3', mRNA sequence 2.0 



120296 AW995911 Hs.299883' hypothetical protein FU23399 1J 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC FINGER PROTEIN 195 (Ksaplens] 15.2 

120324 AA195517 Hs.191643 ESTs ' 5.5 

120325 AA195651 Hs.1O4106 ESTs 6.4 
120327 AK000292 Hs.278732 hypothetical protein FU20285 16.1 
120336 N85785 Hs.181165 eukaryotic translation elongation factor 1 afeha 1 2.9 
120342 AW450669 Hs.45068 hypothetical protein DKFZp434l1 43 5.7 
120345 AA210722 Hs.104158 ESTs 4.5 
120349 AW969481 Hs.55189 hypothetical proteh 16.8 
120352 R06859 Hs.193172 ESTs, Weakly simOar to I38022 hypothetical protein |H.saplens] 5.0 
120356 AF000545 Hs.296433 putative purineigic receptor 28.1 
120371 AA219305 Hs.104196 EST 12.4 

120382 AA228026 Ns.38774 ESTs 4.0 

120383 AL109963 Hs.123122 FSH primary response (LRPR1, rat) homolog 1 9.7 
120386 AW969665 Hs.154848 hypothetical protein DKFZp434D0127 32.6 

120388 AA232874 Hs.104245 ESTs 3.1 

120389 AW967985 Hs.325572 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 21 .7 
120396 AA134006 Hs.79305 eukaryotic translation initiation factor 4E 12£ 
120404 AB023230 Hs.96427 KIAA1013 protein 72 
120418 AW966893 Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (from clone DKFZp586F1323) 11.4 
120423 AA236453 Hs.18978 Homo sapiens cDNA: FU22822 fis, clone KAIA3968 1.9 

120472 AI950087 . gbwq05c02jc1 NCI_CGAPJ<Id12 Homo sapiens cDNA clone 9, mRNA sequence 19.4 

120473 AA251973 Hs.269988 ESTs 5.4 
120484 AA253170 Hs.96473 EST 10.4 
120504 AA256837 gb:zr84d10.s1 SoaresJWMPu_.S1 Homo sapiens cDNA clone IMAGE:682387 3", mRNA sequence. 3.9 
120509 BE047718 Hs.96545 ESTs 9.4 
120520 AA258601 Hs.161731 EST 2.4 
120535 BE350244 Hs.96547 ESTs 2.5 
120551 AA279160 Hs.1 11407 Homo sapiens, clone IMAGE:3613029, mRNA, partial cds 5.2 
120570 AA280679 Hs.271445 ESTs, Weakly similar to ALU1_HUtyAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 14.4 
120582 BE244830 Hs.284228 ZNF135-like protein .10.2 
120590 AW372799 Hs.125790 leucine-rich repeat-containing 2 2.1 
120596 AA282074 Hs.237323 N-acetylglucosamine-phosphate mutase 7.5 
120619 AW965339 Hs.111471 ESTs 2.5 
120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52.0 
120639 AA286942 gb:zs56f05.s1 NCLCGAPJ3CB1 Homo sapiens cDNA clone IMAGE:701505 3' similar to contains Alu2.4 
120648 AA287095 Hs.140309 Homo sapiens, clone IMAGE:3677194, mRNA, partial cds 5.0 
120653 AW063659 Hs.191649 ESTs * 22 

120668 AW969638 Hs.1 12318 6.2 kd protein 22 

120669 BE536739 Hs.109909 ESTs 19 

120695 AA976503 gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens cDNA clone 3' similar to contains PTR7.t1 PTR7 46.8 

120696 AI821539 Hs.97249 ESTs 2.5 
120713 AW449855 Hs.96557 Homo sapiens cDNA FU12727 fis, done NT2RP2000027 5.9 
120718 AA292747 Hs.97296 ESTs 2.9 
120750 AI191410 Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor [H.saplens] 7.0 
120774 AI608909 Hs.193985 ESTs 7.8 
120807 AA346385 Hs.30002 SH3-contalning protein SH3GLB2; KIAA1848 protein 6.8 
1 20809 AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing 0 family repeat, 4.4 
120938 AA386260 Hs.104632 EST 4.4 
120977 AA398155 Hs.97600 ESTs 4.4 

120984 BE262951 Hs.99052 ESTs 5.6 

120985 AI219896 Hs.97592 ESTs 1.2 
121011 AA398360 Hs.97608 EST 3.1 
121026 A1439713 Hs.165295 ESTs 3.5 
121081 AA398721 Hs.186749 ESTs, Highly similar to 137550 mismatch repair protein MSH2 [Haptens] 5.4 
121133 AA363307 Hs.97032 ESTs . 3.7 
121176 AL121523 Hs.97774 ESTs ' 1.7 
121223 AI002110 Hs.97169 ESTs, Weakly similar to dJ667H1 2.2.1 [H.sapiens] 2.9 
121320 AA403008 Hs.301927 C6.1A 1.9 
121340 AW956981 Hs.97910 Homo sapiens cDNAFU 13383 fis, done PLACE1001024 . 3.5 
121408 AA406137 Hs.98019 EST 6.0 
121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-cell growth factor precursor [H.saplens] 7.4 
121450 AA406430 Hs.105362 Homo sapiens, clone MGC:18257, mRNA, complete cds 6.9 
121452 AW971063 Hs.292882 ESTs 1.8 
121455 H58306 Hs.15165 retinoic add induced 14 10.5 
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21457 W07404 Hs.1 44502 hypothetical protein FLJ22055 '. 3.4 

21498 AA442224 Hs.97900 ESTs 14.4 

21505 AA494172 Hs.194417 ESTs 13.1 

21508 AA402515 Hs.97887 ESTs . 28.0 

21513 AA416653 Hs.181510 ESTs 6.2 

21514 AA412112 gb:zt69b02.s1 SoaresJestisJIHT Homo sapiens cDM ctone IMAGE;727563 3\ mRNA sequence, 2.6 
21549 AA412477 Hs.98142 EST 7.4 
21558 AA412497 gb:zt95g12.s1 SoaresJestisJIHT Homo sapiens cDNA clone IMAGE:730150 3' similar to conteins2.8 
21577 AA411970 Hs.98096 EST 3.5 
21581 AA416568 gb:zu05c10.s1 SoaresJestisJIHT Homo sapiens cDNA clone 3\ mRNA sequence 6.1 
21589 AD001528 Hs.89718 spermine synthase 3.9 
21594 AA626010 Hs.98247 ESTs 22 
21622 AA416931 Hs.1260B5 ESTs 4.2 
21655 AA421537 ,Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (from clone DKFZp434B1023) 7.8 
21682 AA418160 Hs.86043 Homo sapiens cDNA FLJ13558 fis, clone PLACE 1007743 20 
21690 AV660305 Hs.1 10286 ESTs 4.7 
21706 U55184 Hs.154145 hypothetical protein FU1 1585 12.7 
21714 AA419225 Hs.98269 Homo sapiens cDNAFLM 1953 fis, clone HEMBB1000883 8.1 
21729 A1949597 Hs.98325 ESTs 1.8 
21731 AA421041 Hs.180744 ESTs 4.0 
21744 AA398784 Hs.97514 ESTs. 7.1 
21748 BE536911 Hs.234545 hypothetical protein NUF2R 19.5 
21773 AB033022 Hs.158654 KIAA1196 protein 7.9 

21775 AA421773 Hs.161008 ESTs 1.7 

21776 AA292579 Hs.125133 hypothetical protein FU22501 6.6 
21786 AI810774 Hs.98376 ESTs 10.5 , 
21832 AW340797 Hs.98434 ESTs 5.8 
21836 AA328348 Hs.218289 ESTs 3.8 
21839 AA425691 Hs.191606 ESTs, Highly similar to KIAA1048 protein [H^aplens] . 5.0 
21842 AF027406 Hs. 104865 serine/mreonlne kinase 23 27 
21847 AA446628 Hs.2799 cartilage linking protein 1 23 
21871 AW972668 Hs.293044 ESTs 2.9 
21882 AA426376 Hs.98459 ESTs 5.0 
21911 AA427950 gb:zw50t02.s1 Soares_totaLfetus.Nb2HF8.9w Homo sapiens cDNA clone IMAGE:773499 3' 72 
21915 AA428179 Hs.223405 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H^aplens] 25 
21935 AA428647 Hs.98611 EST 23 
21983 AA298760 Hs.180191 hypothetical protein FU14904 3.4 
21985 A1862570 Hs.299214 Homo sapiens, clone IMAGE:2822295, mRNA, partial cds 11.4 
21995 AA210863 Hs.3532 nemo-like kinase 3.8 
21999 AA430211 Hs.98668 EST . 6.4 
22009 AW292763 Hs.160822 Homo sapiens cDNA: FU20863 fis, clone ADKA01804 22 
22013 AA431085 Hs.98706 ESTs 6.5 
22036 W92142 Hs.271963 ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 13.1 
22050 AI453076 Hs.166109 ELAV (embryonic lethal, abnormal vision, DrosophilaHike 2 9.1 
22060 AA431738 Hs.98750 EST 13.1 
22114 AW161023 Hs.104921 ESTs 1.5 
22188 AA398838 gb:zt80d01.r1 SoaresJestisJIHT Homo sapiens cDNA clone 5*, mRNA sequence 3.3 
22204 AA435936 Hs.98842 EST 5.6 
22246 AA329550 Hs.29417 HCF-binding transcription factor Zhangfei 5.1 
22257 AA436819 Hs.98899 ESTs 5.6 
22302 AA441801 Hs.104947 ESTs 5.8 
22341 AW601969 Hs.99010 hypothetical protein FLJ22263 similar to nuclear receptor-binding SET-domah protein 1 . 2.0 
22356 AA443794 Hs.98390 ESTs 7,3 
22369 AA443985 Hs.303222 ESTs - 112 

22371 AA868555 Hs.178222 ESTs 5.0 

22372 AA446008 Hs.336677 EST 7.6 
22378 AB032948 Hs.21356 hypothetical protein DKF2p762K2015 25 
22405 AA446572 Hs.303223 EST 2.8 
22412 AA446B69 Hs.1 19316 ESTs 7.3 
22415 AA446918 Hs.99088 EST 1.9 
22418 AA446966 Hs.99090 ESTs, Moderately similar to similar to KIAA0766 [H.saplens] 6.8 
22440 AW505139 Hs.9460 Homo sapiens mRNA; cDNA DKFZp547C244 (from clone DKFZp547C244) 2.6 
22446 AA447603 Hs.99123 EST 1.8 
22448 AA447626 Hs.99127 EST 3.5 
22458 AI266159 Hs,104980 ESTs 1.5 
22460 AW418788 Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 protein - Caenorhabditis elegans [C.elegans] 9.7 
22464 AA448158 Hs.99152 EST 4.8 
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AB040893 
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AL355841 
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AA453630 

AA453638 

A1681535 , 

AA453641 
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AA456859 
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AA460584 
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AF005216 
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AW081394 
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AA470074 

AA470140 

AA478951 
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AW338067 

AL359571 

A1382600 

M485360 
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AA486256 

BE304942 

T52027 

AI061582 

AW451999 
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AW601773 

AA490929 

AA496369 

AA504757 

AA731404 

AW450922 

AA599042 

AW303285 

AB021644 
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N95059 

BE019072 

AW380388 

AA609170 
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AA609364 

A1269609 
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Hs.99195 

Hs.40368 

Hs.164589 

Hs.293858 

Hs.262907 
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Hs.318722 

Hs.97103 

Hs.1 28654 

Hs.1 69896 

Hs.229170 

Hs.105629 

Hs.17384 

Hs.323231 

Hs.44054 

Hs.104308 

Hs.105661 

Hs.105667 

Hs.193510 

Hs.265848 

Hs.271795 

Hs.324179 

Hs.194024 

Hs.1 05676 

Hs.270259 

Hs.105274 
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Hs.1 12478 
Hs.1 12503 
Hs.303632 
Hs.197219 
Hs.250528 
Hs.55098 
Hs.334802 
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EST 
ESTs 



Homo sapiens cDNA FU12082 fis f clone HEMBB1002492 
ESTs 

adaptor-related protein complex 1, slgma 2 subunit 
ESTs 
ESTs 
ESTs 
EST 

Homo sapiens cDNA RJ11048 fis, done PLACE1004516 
KIAA1460 protein 
ESTs 

hypothetical protein FU23588 
ESTs 
ESTs 
EST 
ESTs 

serine/threonine kinase 33 

gb:zx48e06.s1 Soares .testis JIHT Homo sapiens cDNA done 3\ mRNA sequence 
ESTs 1 
ESTs 
ESTs 
ESTs 

Homo sapiens cDNA FU10658 fis, done NT2RP2006052 
ESTs 

ESTs, Weakly similar to putative p150 [H.sapiensJ 
ESTs 

NIMA (never in mitosis gene a)-related kinase 6 
Src-Iike-adapter 
ESTs 
ESTs 

Janus kinase 2 (a protein tyrosine kinase) 
Homo sapiens cDNA: FLJ21766 fis, done COLF7179 
ESTs 
ESTs 
ESTs 
EST 
ESTs 
ESTs 

Homo sapiens cDNA FU11946 fis, done HEMBB1000709 
ninein (GSK3B interacting protein) 
ESTs, Weakly similar to KIAA1395 protein [Ksaplens] 
ESTs 
ESTs 
EST 

myomegaHn 

ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 
Homo sapiens cDNA FU12371 fis, done MAMMA1002434 
ESTs 
ESTs 
ESTs 

ESTs, Weakly slmflar to RMS1JHUMAN REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 [Rsapiens] 
gb:zv37d10.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:755827 3' similar to4.1 
ESTs Jf 
ESTs 36 
ESTs 3.7 

EST 74 

Human DNA sequence from done RP11-110H4 on chromosome 5 Contains a pseudogene simflarto 3.5 



6.1 
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I. 3 

II. 2 
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I. 9 
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II. 0 
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1.7 
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4.6 
615 
10.7 
107.3 
121.4 
31.1 
5.6 
6.5 
10.4 
81.8 
3.6 
4.5 
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75.3 
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1.3 
4.1 
5.3 
9.9 
5.3 
13.9 
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15.4 
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8.7 
8.8 
3.9 
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7.4 
2.8 
2.4 
15.6 
5.1 
23.8 
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zinc finger protein 14 (KOX 6) 
Homo sapiens, done IMAGE:4098694, mRNA, partial cds 
ESTs 

Homo sapiens cDNA aJ14680 fis, clone NT2RP2004242, weakly similar to 
KIAA1080 protein; Golgl-assodated, gamma-adaptin ear containing, ARF-binding protein 2 
gbaf12a12.s1 Soares_testis_NHT Homo sapiens cDNA clone 3', mRNA sequence 
gb:no97c02.s1 NCI CGAP Pr2 Homo sapiens cDNA done, mRNA sequence 
gb:zu71d09.s1 Soares^testis_NHT Homo sapiens cDNA done IMAGE:743441 3' similar to contains Alu. 
Hs.105187 kinesin protein 9 gene 5 - 7 
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123735 N&/L013241HS.95231 FH1/FH2 domain-containing protein .10.0 

123738 AA609891 Hs.1 12777 EST . 52 

123753 AA609955 Hs.234961 Huntingttn Interacting protein E 30.6 

123804 AA620464 Hs.261915 EST, Weakly similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.saplens] 2.1 

123811 AA620586 gb:ae60g05.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA clone IMAGE:9513203* 2.7 

123951 AB012922 Hs.173043 metastasis-associated 1-like 1 .6.2 

123983 AJ272267 Hs.146178 choline dehydrogenase 4.4 

124001 L42542 Hs.75447 ralA binding protein 1 , 7.0 

124006 AM47155 Hs.270016 ESTs 6.1 

124070 AI950314 Hs.1 54762 HIV-1 rev binding protein 2 3.7 

124074 H05635 Hs^94030 topolsomerase-related function protein 4-2 1.2 

124178 BE463721 Hs.97101 putative G protein-coupled receptor 3.1 

124203 AA372796 Hs^69339 ESTs, Weakly similar to AF161356 1 HSPC093 [H.saplens] 5.7 

124352 AA640891 Hs.102406 ESTs 3.1 

124375 D87454 Hs.192966 KIAA0265 protein 3.5 

124385 AI267847 gb:aq49a10j(1 Stanley Frontal NB pool 2 Homo sapiens cONA clone similar to contains 57.1 

124390 AA317338 Hs.7535 COBW-Dke protein 2.8 

124391 AF155099 Hs.279780 NY-REN-18 antigen .' 7.1 
124417 N34059 gb;yv28h09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done 3 similar to contains Alu 3.3 
124428 H13540 Hs.82202 ribosomal protein L17 2.9 
124440 AA532519 Hs.129043 Human DNA sequence from clone 989H1 1 on chromosome 22q13.1-13.Z Contains part of a 7.8 
124466 R10084 Hs.113319 klnesin heavy chain member 2 2.6 
124482 N53935 gb:yv59d09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3', mRNA sequence 7.9 
124498 H79433 . Hs.268997 ESTs 7.8 
124515 AA669097 Hs.109370 ESTs 3.3 
124608 N71076 Hs.102800 ESTs, Weakly similar to neuronal thread protein AD7C-NTP [H^apiens] 4.5 
124631 NWL014053HS.270594 aVCR protein 3.2 
124634 AI765123 Hs.143671 Homo sapiens cDNA FU13533 fis, clone PLACE1006371 5.8 
124637 AA160474 Hs.75798 hypothetical protein 9.3 
124642 AW968856 Hs.278569 sorting nexm 17 3.5 
124649 N92593 Hs.313054 ESTs 6.1 
124656 AW297702 Hs.102915 ESTs 8.3 
124661 R48170 Hs.78436 EphB1 5.6 
124683 AA381661 Hs.119878 ESTs, Weakly similar to M3K9_HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE 7.9 
124712 R09166 Hs.191148 ESTs 5.7 
124735 R22952 Hs.268685 ESTs 11.3 
124761 AA374756 Hs.93560 Homo sapiens mRNA for KIAA1771 protein, partial cds 9.0 
124768 AW368528 Hs.100855 ESTs 8.1 
124775 R41772 Hs.100878 ESTs 4.9 
124777 R41933 Hs.140237 ESTs, Weakly similar to ALU1J4UMAN ALU SUBFAMILY J SEQUENCE 2.8 
124788 R43543 Hs.100912 Homo sapiens cDNA: FLJ22726 fis, clone HSI15005 5.1 
124809 AL355722 Hs.106875 Homo sapiens EST from done 35214, full Insert 4.2 

124811 R46088 Hs.288912 hypothetical protein FU22604 145 

124812 R47948 Hs.188732 ESTs 7.9 
124822 AA418160 Hs.86043 Homo sapiens cDNA FU13558 fis, clone PLACE1007743 . 6.6 
124825 AA501669 Hs.336693 ESTs 23 
124833 AW975868 Hs.294100 ESTs 2.7 
124857 R63652 Hs.137190 ESTs 2.3 
124860 R65763 Hs.101477 EST 23.9 
124863 AI382555 Hs.127950 bromodomain-containlng 1 2.0 
124876 AF1 35422 Hs.27059 GDP-mannose pyrophosphorylase A 4.4 
124878 BE397530 Hs.288057 hypothetical protein FU22242 2.7 

124902 H37941 Hs.101883 ESTs 5.7 

124903 AW296713 Hs.221441 ESTs . 32.4 
124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUB_HUMAN 111! ALU CLASS B WARNING ENTRY II! [Rsaptens] 22.8 
124942 R99978 Hs.268892 ESTs, Moderately similar to B34087 hypothetical protein [H^aplens] 6.1 
124958 AJ078645 Hs.431 murine leukemia viral (bmM) oncogene homolog 1.9 
124980 T40841 Hs.98681 ESTs 4.5 
125002 T59338 Hs.269463 ESTs, Weakly similar to ALU1JWMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 4.9 
125047 T79815 Hs.279793 ESTs 5.0 
125051 T79956 Hs.100588 EST 135.3 * 
125056 T81310 Hs.100592 ESTs 5.4 
125101 AI472068 Hs.286236 KIAA1 856 protein 5.6 
125113 T96595 Hs.302270 ESTs, Weakly similar to ALUFJHUMAN lilt ALU CLASS F WARNING ENTRY III [H.saplens] 1.8 
125115 T97341 gb:ye57e05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:121856 3' similar to 
125125 AI222382 Hs.240767 Human DNA sequence from done RP1-12G14 on chromosome 6q24.1-25.Z Contains the 5* end of the gene 
125147 W38150 Empirically selected from AFFX single probeset 1.7 
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125161 W44657 Hs.144232 EST • 10.7 

125249 AA630863 Hs.131375 ESTs, Moderately similar to ALUB_HUMAN III! ALU CLASS B WARNING ENTRY III [H^aplens] U 



125255 AF098162 Hs.118631 timeless (Drosophila) homolog 

125279 AW401809 Hs.4779 KIAA1 150 protein 

125280 AI123705 Hs.106932 ESTs 

125298 AW972542 Hs.289008 Homo sapiens cDNA: FU21814 fis, clone HEP01068 
125660 AW292171 Hs.23978 scaffold attachment factor B 
125827 -NWL003403HS.97496 YY1 transcription factor 
125891 U29589 Hs.7138 cholinergic receptor, muscarinic 3 
126005 AW409701 Hs.1578 baculoviral IAP repeat-containing 5 (survivin) 
126202 AA157632 Hs.272630 vacuolar proton pump delta polypeptide 
126695 AA643322 Hs.172028 adisintegrmandmetalloproteinasedomalnlO 
127050 AW411066 Hs.274351 CGI-89 protein 

127274 AW966158 Hs.58582 Homo sapiens cDNA RJ 12789 fis, clone NT2RP2001947 
128355 AW293012 Hs.161623 ESTs 
128493 D87466 Hs.240112 KIAA0276 protein 
128493 D87466 Hs.240112 KIAA0276 protein 
128522 BE173977. Hs.10098 putative nucleolar RNA helfcase 

128527 AA504583 Hs.101047 transcription factor 3 (E2Aimmur»globuRnenrencer binding factors E12/E47) 

128528 R39234 Hs.251699 ESTs, Weakly similar to IDN4-GGTR14 [H.sapiens] 
128595 U31875 Hs.272499 short-chain alcohol dehydrogenase family member , 
128599 NM_015366Hs.102336 Rho GTPase activating protein 8 
128604 AI879099 Hs.102397 GIOT-3 for gonadotropin inducible transcription repressor^ 
128608 BE267994 Hs.102419 zinc finger protein 
128625 AB037841 Hs.102652 hypothetical protein ASH1 
128629 AL096748 Hs.102708 DKFZP434A043 protein 
128639 AW582962 Hs.102897 CGI-47 protein 
128656 AA458542 Hs. 10326 coatomer protein complex, subunit epsilon 
128656 AA458542 Hs. 10326 coatomer protein complex, subunit epsilo 
128658 BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis {Saccharomyces)-like 2 2.4 
128670 AA975486 Hs.103441 Homo sapiens, Similar to RiKEN cONA 1700010L19 gene, clone MGC:16214, mRNA, complete cds 7.1 



9.4 
1.5 
8.0 
15 
5.9 
1.2 
6.4 
14.3 
2.4 
9.1 
17.0 
128 
7.3 
3.1 
1.3 
9.4 
15 
U 
12.1 
2.3 
1.3 
7.1 
U 
3.2 
ZQ 
1.4 
U 



128691 W27939 Hs.103834 hypothetical protein MGC5576 

128696 BE081143 Hs.225977 nuclear receptor coactlvator 3 

128700 Y15221 Hs.103982 small inducible cytokine subfamily B (Cys-X-Cys), member 1 1 

128714 T85231 Hs.179661 tubulin, beta 5 

128717 AK001564. Hs.104222 hypothetical protein FU10702 

128733 BE147740 Hs.104558 ESTs, Moderator/ similar to I38022 hypothetical protein [H.saplens] 

128737 AF292100 Hs.104613 RP42 homolog 

128742 AA307211 Hs.251531 proteasome (prosome, macropaln) subunh, alpha type, 4 

128746 AI470163 Hs.323342 actln related protein 2/3 complex, subunit4 (20 kD) 

128747 AB027249 Hs.104741 PDZ-binding kinase; T-celi originated protein kinase 
128772 BE302796 Hs.105097 thymidine kinase 1, soluble 

128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypeptide F 

128797 NMJ02975HS.105927 stem cell growth factor; lymphocyte secreted C-type lectin 

128806 AW630942 Hs.106061 RD RNA-binding protein 

128814 AW248431 Hs.256526 nuclear prelamin A recognition factor 

128830 BE281170 Hs.106357 vatosln-containlng protein 

128835 AK001731 Hs.106390 Homo sapiens mRNA; cDNA DKFZp586H0924 (from clone DKFZp586H0924) 

128854 BE159181 Hs.168232 hypothetical protein FU 13855 

128854 BE159181 Hs.168232 hypothetical protein FLJ13855 

128868 AA419008 Hs. 106730 chromosome 22 open reading frame 3 

128868 AA419008 Hs.106730 chromosome 22 open reading frame 3 

128871 AF189723 Hs.106778 ATPase, Ca++ transporting, type 2C, member 1 

128891 F34856 Hs.292457 Homo sapiens, clone MGC:16362, mRNA, complete cds 

128906 R57988 Hs.10706 epithelial protein lost In neoplasm beta 

128920 AA622037 Hs.166468 programmed cell death 5 . 

128925 R67419 Hs.21851 Homo sapiens cDNA FU12900 fis, clone NT2RP2004321 

128946 Y13153 Hs.107318 kynurenlne 3-monooxygenase (kynurenine hydroxylase) 

128949 AA009647 Hs.8850 a dlsintegrin and metaDoproteinase domain 12 (meltrin alpha) (ADAM-12) 

128958 AW139032 Hs.107376 hypothetical protein DKFZp434N035 

128959 AI580127 Hs.107381 hypothetical protein FU11200 
128965 AW150697 Hs.107418 ESTs 

128970 AI375672 Hs.165028 ESTs 

128975 BE560779 Hs.284233 NICE-6 protein 

128979 AW271217 Hs.281434 Homo sapiens cDNA FLJ14028 fis, done HEMBA1003838 

128995 AI816224 Hs.107747 DKF2P566C243 protein . 

129019 AI950087 gbwq05c02j(1 NCLCGAP_Kid12 Homo sapiens cDNA clone 3', mRNA sequence 

128 



7.7 
3.8 
1.6 
7.6 
5.5 
2.7 
2.8 
4.4 
22 
2.8 
5.3 
53.9 
13.3 
2.6 
.2.2 
5.9 
1.6 
22 
1.9 
3.0 
22 
1.5 
13.3 
4.7 
1.4 
1.9 
72 
2.4 
1.3 

1.4 

1.3 

14.0 

1.6 

1.9 

2.9 



WO 02/059377 



129021 AL044675 Hs.173081 KIAA0530 protein 
129021 AL044675 Hs.173081 KIAA0530 protein 
129032 R80088 Hs.108104 ubiquitin-conjugating enzyme E2L 3 

129076 AW296806 Hs.326234 ESTs, Highly similar to T46422 hypothetical protein DKFZp434M2023.1 [Rsaplensl 
129078 AI351010 Hs.102267 lysosomal 
129088 AA744610 Hs.194431 paDadln 

129095 L12350 Hs.108623 thrombospondin 2 

129096 AA463189 Hs.288906 WW Domain-Containing Gene 

129097 BE243933 Hs.108642 zinc finger protein 22 (KOX 15) 

129099 AF146074 Hs.108660 ATP-binding cassette, sub-family C (CFTR/MRP). members 

129136 W93048 Hs.250723 hypothetical protein MGC2747 

129149 AA356620 Ks.108947 KJAA0050 gene product 

129172 AW162916 Hs.241576 hypothetical protein PR02577 

129192 AA286914 Hs.183299 ESTs 

129194 AA150797 Hs.109276 tatexln protein 

129198 N57532 Hs.109315 KIAA1415 protein 

129207 AI934365 Hs.109439 osteogrycin (osteoinductive factor, mimecan) 

129228 U40714 Hs.239307 tyrosyl-tRNA synthetase 

129229 AF013758 Hs.109643 polyadenylate binding protein-interacting protein 1 

129254 AA252468 Hs.1098 DKFZp434J1813 protein 

129255 A1961727 HS.109Q04 H1 histone family, member X 

129288 W26392 Hs.110080 ESTs, Weakly similar to S13495 pregnancy zone protein [H^apiens] 
129296 AI051967 Hs.110122 ESTs 

129323 AA287239 Hs.5518 Homo sapiens cDNA FU11311 fis, clone PLACE1010102 
129340 H75334 Hs.11050 F-box only protein 9 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised by cytotoxic T lymphocytes 
129362 U30246 Hs.110736 solute carrier family 12 (sodfurn/potasslunVchloride transporters), member 2 
129366 BE220806 Hs.184697 Homo sapiens clone 23785 mRNA sequence 
129370 AI686379 Hs.110796 SAR1 protein 
129372 NM_016039Hs.1 10803. CGI-99 protein 

129403 AF1 49785 Hs.1 11126 pituitary tomor-transforrning 1 1nteracting protein 

129404 AI267700 Hs.317584 ESTs 
129404 AI267700 Hs.317584 ESTs 

129423 AA204686 Hs.234149 hypothetical protein FU20647 

129449 AI096988 Hs.1 11554 ADP-rlbosylation factor-like 7 

129453 AW974265 Hs.1 11632 Lsm3 protein 

129482 AA188185 Hs.289043 spbdlin 

129482 AA188185 Hs.289043 spindBn 

129513 AW843633 Hs.306163 hypothetical protein AL1 101 15 

129515 AF255303 Hs.1 12227 membrane-associated nucleic acid binding protein 

129527 AA769221 Hs.270847 delta-tubulin 

129559 W01296 Hs.1 1360 hypothetical protein FU14784 

129560 AA317841 Hs.7845 hypothetical protein MGC2752 
129570 AJ923097 Hs.1 1441 chromosome 1 open reading frame 8 
129575 F08282 Hs.278428 progestin induced protein 

129587 H14718 Hs.1 1506 Human clone 23589 mRNA sequence 

129588 BE408300 Hs.301862 postmeiotic segregation increased 2-iike 9 
129591 N57423 Hs.179898 HSPC055 protein 

129594 AW403724 Hs.36989 coagulation factor VII (serum prothrombin conversion accelerator) 
129596 AF035537 Hs.115521 REV3 (yeast homologj-lite, catalytic subunit of DNA polymerase zeta 
129628 U38945 Hs.1174 cyclin-dependent kinase Inhibitor 2A (melanoma, p16, inhibits CDK4) 

129628 U38945 Hs.1174 cyclin-dependent kinase inhibitor 2A (me 

129629 AK000398 ' Hs.1 1747 hypothetical protein FU20391 
129649 AD000092 Hs.16488 calreticulin 

129675 NMJ)15556Hs.172180 KIAA0440 protein 

129680 U03749 gbiHuman chromogranln A (CHGA) gene, promoter an 

129689 AW748482 Hs.77873 B7horriotog3 

129702 AI304966 Hs.1 2035 ESTs, Weakly similar to 138022 hypothetical protein [Ksapiens] 

129720 AA156214 Hs.12152 APMCF1 protein 

129721 NM.001415HS.211539 eukaryotic translation initiation factor 2, subunit 3 (gamma, 52kD) 
129726 H15474 Hs.1 32898 fatty acid desaturase 1 

129778 AK001676 Hs.12457 hypothetical protein FU10814 

129779 AA394090 Hs.12460 Homo sapiens clone 23870 mRNA sequence 
129800 AF052112 Hs.12540 lysosomal 

129806 AB023148 Hs.173373 KIAA0931 protein 

129815 BE565817 Hs.26498 hypothetical protein FLJ21657 

129840 NM_006590Hs.12820 SnRNP assembly defective 1 homotog 
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129861 AL049999 Hs.85963 DKFZP564M1 82 protein 2.2 

129864 AI393237 Hs.1 29914 runt-related transcription factor 1 (acute myeloid leukemia 1;aml1 oncogene) 1.7 

129869 AI222069 Hs.13015 hypothetical protein similar to mouse DnajH 2.7 

129922 AF042379 Hs.13386 gamma-tubulin complex protein 2 " 4 - 5 

129945 BE514376 Hs.165998 PAM mRNA-binding protein 1* 

129953 AA412195 Hs.13740 ESTs 2.5 

129972 AW753185 Hs.180628 dynamln 1-IIke 1.8 

129983 U09848 Hs. 132390 zinc finger protein 36 (KOX 18) 1.3 

129989 AB015856 Hs.247433 activating transcription factor 6 . 4.0 

130010 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 1.6 

130081 AA287325 Hs.14713 ESTs 4.0 

130082 S73265 Hs.1473 gastrin-releaslng peptide 1.8 
130097 AL046962 Hs.14845 forkhead box 03A 2.8 
130100 AL135561 Hs.14891 hypothetical protein FU21047 2.3 

130111 X53002 Hs.149846 integrin, beta 5 2.3 

130112 AA916785 Hs.180610 splicing factor proline/glutamtne rich {polypyrimldine tract-binding protein-associated) 3.0 
130112 AA916785 Hs.180610 splicing factor proline/glutamlne rich ( 2.1 
130128 L76937 Hs.150477 Werner syndrome 1.8 
130135 AA311426 Hs.21635 tubulin, gamma 1 6-1 

13021 1 NM 003358HS.23703 ESTs, Moderately similar to CEGT_HUMAN CERAMIDE GLUCOSYLTRANSFERASE [H.sapiens] 1.6 

130212 D80001 Hs.152629 KIAA0179 protein ■ 1-3 
130236 R85367 . Hs.51957 splicing factor, arginine/serine-rich 2, interacting protein . 2.0 

130241 AL035588 Hs.153203 MyoD family inhibitor 3.2 

130242 X79201 Hs.153221 synovial sarcoma, translocated to X chromosome 5.4 
130249 D81983 Hs.322852 GAS2-related on chromosome 22 4.8 
130263 NM.002497HS.153704 NIMA (never in mitosis gene a)-related kinase 2 1.4 
130287 AA479005 Hs.154036 tumor suppressing subtransferable candidate 3 2.6 
130310 AB011121 Hs.154248 amyotrophic lateral sclerosis 2 fluvenBe) chromosome region, candidate 3 6.3 
130353 Z19084 Hs.172210 MUF1 protein t 6.2 

130356 AF127577 Hs.155017 nuclear receptor Interacting protein 1 2.4 

130357 AJ224442 Hs.155020 putative methyltransferase 3.4 
130359 NM_013449Hs.277401 bromodomaln adjacent to zinc finger domain, 2A 8.5 
130367 AL135301 Hs.8768 hypothetical protein FU10849 1.4 
130372 AI077464 Hs.5011 RNA binding motif protein 9 3.3 
130393 N89487 Hs.155291 KIAA0005 gene product 1.8 
130399 AW374106 Hs.155356 hypothetical protein MGC2840 similar to a putative glucosyltransferase 3.4 
130407 BE385099 Hs.334727 hypothetical protein MGC3017 2.3 
130409 NM_001197Hs.155419 BCL2-interacting killer (apoptosis-indudng) 2.7 
130419 AF037448 Hs.155489 NS 1 -associated protein 1 1.8 
130441 U63630 Hs.155637 protein kinase. DNA-activated. catalytic polypeptide 2.3 
130448 BE513202 Hs.15589 PPAR binding protein 3.9 
130455 D90041 Hs.155956 N-acetyltransferase 1 (aryiamine N-acetyltransferase) 33.6 
130455 D90041 Hs.155956 N-acetyltransferase 1 (aryiamine N-acety 4.6 
130471 AL121438 Hs.183706 adducinl (alpha) 2.7 
130485 BE245851 Hs.180779 H2B histone family, member B 5.0 
130487 U49844 Hs.77613 ataxia telangiectasia and Rad3 related 4.3 
130498 L38951 Hs.180446 karyopherin (importin) beta 1 1-6 
130503 BE208491 Hs.2951 12 KIM0618 gene product .16.1 
130511 L32137 Hs.1584 cartilage oiigomeric matrix protein (pseudoachondroplasla, epiphyseal dysplasia 1, multiple) 6.1 
130511 L32137 Hs.1584 cartilage oiigomeric matrix protein (pse 5.3 
130526 AW876523 Hs.15929 hypothetical protein FU12910 2.1 
130542 U64675 Hs.179825 RAN binding protein 2-«ke 1 7.8 
130544 AA321238 Hs.4310 eukaryotic translation Initiation factor 1 A 1-5 
130553 AF062649 Hs.2525B7 pituitary tumor-transforming 1 14.4 
130556 AI907018 Hs.15977 Empirically selected from AFFX single probeset 4.7 

130567 AA383092 Hs.1608 replication protein A3 (14kD) 73 

130568 AA232119 Hs.16085 putative G-protein coupled receptor 33 
130574 AF083208 Hs.16178 apoptosis antagonizing transcription factor 1.2 
130586 AB007891 Hs.16349 KIAA0431 protein 5.6 
130598 AL042210 Hs.16493 hypothetical protein DKFZp762N2316; K1AA1803 protein 1.4 
130601 AA609738 Hs.16525 ESTs 1.5 
130614 AI354355 Hs.16697 down-regulator of transcription 1, TBP-binding (negative cofactor 2) 13 
130617 M90516 Hs.1674 glutamine-fructose-6-phosphate transaminase 1 12.1 

130617 ft/190516 Hs.1674 glutemine-fructose-6-phosphatetransamin 2.4 

130618 AA383439 Hs.16758 Spir-1 protein 15.9 
130667 BE246961 Hs.17639 Homo sapiens ubiquitin protein ligase (UBE3B) mRNA, partial cds 13.9 
130674 AL048842 Hs.194019 attractin 1.5 
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130675 AA442233 Hs.17731 hypothetical protein FU12892 5.4 

130692 AA652501 Hs.13561 hypothetical protein MGC4692 . 5.0 

130693 R68537 Hs.17962 ESTs 10 
130712 AJ271881 Hs.279762 bromodomaln-containlng 7 1.8 

5 130714 AI348274 Hs.18212 DNA segment on chromosome X (unique) 9879 expressed sequence 2.0 

130730 AB007920 Hs.18586 KIAA0451 gene product . 3.7 

130744 H59696 Hs.18747 POP7 (processing of precursor, S. cerevisiae) homotog 3.1 

130751 AF052105 Hs.18879 chromosome 12 open reading frame 1.4 

130757 AL036067 Hs.18925 protein x 0001 5.7 

10 130768 AF258627 Hs.211562 ATP-blnding cassette, sub-family A (ABC1), memberl 5.1 

130789 AK000355 Hs.8899 sirtuin (silent mating type information regulation 2, S.cerevlsiae, homotog) 5 5.2 

130815 AB018298 Hs.19822 SEC24 (S. cerevisiae) related gene family, member D 1.5 

130836 J05068 - Hs.2012 transcobalamin I (vitamin B12 binding protein, R blnderfemiry) 15.7 

130841 AL157468 ,Hs.325825 Homo sapiens cDNA RJ20848 fis, clone ADKA01732 2.8 

15 130843 M447492 Hs.20183 ESTs, Weakly similar to AF1647931 protein x 013 [H.saptens] 1.5 

130844 U76248 Hs.20191 seven In absentia (DrosophBa) homolog 2 3.4 

130855 AJ243706 Hs.143323 putative DrM/chromatin binding motif 1.7 

130861 NM„016578Hs.20509 HBV pX associated protein-8 1.9 

130879 NM_003416Hs.2O76 zinc finger protein 7 (KOX 4, done HF.16) 14 

20 130880 BE514434 Hs.20830 kinesin-like 2 2.1 

130892 AL120837 Hs.20993 high-glucose-regulated protein 8 2.4 

130898 AB033078 Hs.186613 sphingosine-1 -phosphate lyase 1 1.7 

130911 BE409769 Hs.21189 DnaJ (Hsp40) homolog, subfamfly A, member 2 1.8 

130919 N79110 Hs.21276 collagen, type IV, alpha 3 (Goodpasture antigen) binding protein 2.3 

25 130944 BE382657 Hs.21486 signal transducer and activator of transcription 1,91kD 5.4 

130971 N39842 Hs.301444 KIAA1673 2.2 

130992 BE398091 Hs.74316 desmoplakin (DPI. DPII) 1.8 

130993 T97401 Hs.21929 ESTs 1-6 
131005 AV658308 Hs.2210 thyroid hormone receptor interactor 3 1.6 

30 131028 A1879165 Hs.2227 . CCAAT/enhancer binding protein (C/EBP), gamma 1.2 

131042 AI826288 Hs.171637 hypothetical protein MGC2628 1.6 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cys-X-Cys), member 10 7.4 

131046 AA321649 Hs.2248 small Inducible cytokine subfamily B(Cy 3.0 

131047 H23230 Hs.22481 ESTs, Moderately similar to A46010 X-iinked retinopathy protein [H.saplens] 1.7 
35 .131060 AA194422 Hs.22564 myosin VI 5.1 

131060 AA194422 Hs.22564 myosin VI 2.5 

131070 N53344 HS.226Q7 ESTs 7.1 

131076 AA749230 Hs.26433 doiichyt-phosphate (UDP-N-acetylglucosamine) N^cety^lucosaminephosphotransferase 1 ( 2.0 

131076 AA749230 Hs.26433 dolichyl-phosphate (UDP-N-acetylglucosam 1.9 

40 131099 AL133353 Hs.226581 COX15 (yeast) homolog, cytochrome c oxidase assembly protein 7.0 

131174 NMJ)06540Hs.29131 nuclear receptor coactivator 2 1.9 

131185 BE280074 Hs.23960 cycl/nB1 5.8 

131206 AW138839 Hs.24210 ESTs 2.0 

131213 AA885699 Hs.24332 CGI-26 protein 7.0 

45 131225 H62087 . Hs.31659 thyroid hormone receptor-associated protein, 954cD subunit 7.5 

131231 N47468 Hs.59757 zinc finger protein 281 2.9 

131233 D89053 Hs.268012 fatty-acid-Coenzyme A tigase, long-chain 3 3.5 

131243 AW383256 Hs.24752 spectrin SH3 domain binding protein 1 2.8 

131245 AL080080 Hs.24766 thloredoxin domain-containing 2.8 

50 131247 AL043100 Hs.326190 fatty acid amide hydrolase 5.6 

131281 AA251716 Hs.25227 ESTs 5.7 

131283 X80038 Hs.339713 Homo sapiens clone F19374 APO E-C2 gene cluster U 

131305 AV656017 Hs.184325 CGI-76 protein 5.0 

131320 AA505691 Hs.145696 splicing factor (CC1.3) 1.8 

55 131339 AF058696 Hs-25812 Nijmegen breakage syndrome 1 (nibrin) 2.6 

131339 AF058696 Hs.25812 Nijmegen breakage syndrome 1 (nib/in) 2.6 

131375 AW293165 Hs.143134 ESTs 5.4 

131390 BE269388 Hs.182698 mitochondrial ribosomal protein L20 5.3 

131410 BE259110 Hs.279836 HSPC166 protein 2.2 

60 131412 NM_012247Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human setenium donor protein 2.0 

131429 AL046302 Hs.26750 hypothetical protein FU21908 1.4 

131458 BE297567 Hs.27047 hypothetical protein FU20392 1.7 

131475 AA992841 Hs.27263 KIAA1458 protein 2.0 

131501 AV661958 Hs.8207 GK001 protein 2.6 

65 131501 AV661958 Hs.8207 GK001 protein 1.6 

131511 AA732153 Hs.27865 Homo sapiens cDNA: FU21333 fis, clone COL02535 ZO 

131528 AU076408 Hs.28309 UDP^Iucose dehydrogenase 1.6 
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131532 BE268278 Hs.28393 hypothetical protein MGC2592 7.4 

131543 AW966881 Hs.41639 programmed cell death 2 2.2 

131544 AL355715 Hs.28555 programmed cell death 9 (PDCD9) 2.1 
131562 NM.003512HS.28777 H2A histone family, member L 1.7 

5 131564 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, clone PLACE1004405 5.1 
131564 T93500 Hs.28792 Homo sapiens cDNA FU11041 fis, clone PL \1.8 

131569 AL389951 Hs.271623 nudeoporin 50kD 5.0 

131618 BE393822 Hs.29645 Homosaplens mRNA;cDNADKFZp761C029 (from clone DKFZp761C029); paraalcds 1.8 

131622 R78195 Hs.29692 Homosaplens cDNA FU11436 fis, clone HEMBA1001213 1.3 
10 131623 AB037791 Hs.29716 hypothetical protein FU 10980 2^ 

131623 AB037791 Hs.29716 . hypothetical protein FU10980 1-9 
131643 AW410601 Hs.30026 HSPC182 protein 2.9 
131653 AW960597 Hs.30164 ESTs W 
131656 AI218918 Hs.30209 KIAA0854 protein 2.8 

15 131669 X52486 Hs.3041 uracil-DNA glycosylase 2 2.8 

131692 BE559681 Hs.30736 KIAA0124 protein 5,6 

131714 AA642831 Hs.31016 putative DNA binding protein • 2.9 

131722 D13757 , Hs.311 phosphoribosyl pyrophosphate amldotransferase 3.4 
131737 AK001641 Hs.31323 inhibitor of kappa light polypeptide gene enhancer in B-cells, kinase complex-associated protein 3.8 

20 131760 X76732 Hs.3164 nucleoblndin2 ■ 2.9 

131760 X76732 Hs.3164 nucleobindin 2 2.8 

131763 AI878932 Hs.317 topoisomerase (DNA) I 3.4 

131772 AA382590 Hs.170980 WAA0948 protein 255 

131774 BE267158 Hs.169474 DKFZP586J01 19 protein 5.5 

25 131787 D87077 Hs.1 96275 KIAA0240 protein 2.4 

131793 AW966127 Hs.32246 Homo sapiens cDNA FU14656 fis, clone NT2RP2002439 7.9 

131795 BE501849 Hs.32317 high-mobility group 20B 1.4 

131798 X86098 Hs.301449 adenovirus 5 E1 A binding protein 4.1 

131817 U20536 Hs.3280 caspase 6, apoptosb-related cysteine protease 4.2 

30 131824 U28838 Hs.32935 TATA box binding protein (TBP)-associated (actor, RNA polymerase III, GTF3B subunit 2 1 ' 3.5 

131850 AI251317 Hs.33184 ESTs 5.1 

131878 M083764 Hs.6101 hypothetical protein MGC3178 5.8 

131885 BE502341 Hs.3402 ESTs 13.7 

131885 BE502341 Hs.3402 ESTs 2.4 

35 131887 W17064 Hs.332848 SWI/SNF related, matrix associated, adin dependent i^ulator of chromatin, subfamily e, member 1 3.2 

131900 AA099014. Hs.231029 Homo sapiens, clone MGC:15961, mRNA, complete cds 8.7 

131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA, com 2.0 

1 31 904 AF078B66 Hs.284296 Homo sapiens cDNA: FU22993 fis, clone KAT1 1 91 4 5.5 

131905 AA179298 Hs.3439 stomaMke 2 11 - 3 
40 131913 AW207440 Hs.185973 degenerative spermatocyte (homolog DrosophUa; lipid desaturase) 1.7 

131916 AA025976 Hs.34569 ESTs 5.2 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 (yeast APC11 homolog) 2,7 

131929 BE541211 Hs.34804 Homo sapiens cDNA FU1 1472 fis, clone HEMBA1001711 5.3 

131941 BE252983 Hs.35086 ubiquifoi specific protease 1 2.3 

45 131950 AA355113 Hs.35380 x 001 protein 15 

131962 AK000046 Hs.267448 hypothetical protein FU20039 2.3 

131965 W79283 Hs.35962 ESTs 1.4 
131971 BE567100 Hs. 154938 hypothetical protein MDS025 . 3.5 
131977 U90441 Hs.3622 procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline 4-hydroxylase), alpha polypeptide II 6.5 

50 131985 AA503020 Hs.36563 hypothetical protein FU22418 2.4 

131991 AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 2.1 

132019 H56995 Hs.37372 Homo sapiens DNA binding peptide mRNA, partial cds 3.2 

132031 AF193844 Hs.3758 COP9 complex subunit 7a 5.8 

132062 BE266155 Hs.3832 clathrin-associated protein AP47 1.5 

55 132084 NM.002267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.7 

132103 BE171921 Hs.3991 ESTs 1.4 

132105 AV646076 Hs.39959 ESTs 5.8 

132116 AW960474 Hs.40289 ESTs * 1.7 

132176 AA857025 Hs.8878 kinesin-like 1 3.3 

60 132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132192 AA206153 Hs.4209 mitochondrial ribosomal protein L37 5.5 

132194 R42432 Hs.4212 ESTs ' 4.4 

132203 NM_004782Hs.194714 synaptosomal-assodated protein, 29kD 2.2 

132207 BE206939 Hs.42287 E2F transcription factor 6 22 

65 132235 AV658411 Hs.42656 KIAA1681 protein ™ 

132240 AB018324 Hs.42676 KIAA0781 protein 1-5 

132252 AI566004 Hs.141 269 Homo sapiens cDNA: FLJ21550 fis, done COL06258 U. 
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132266 AA301228 Hs.43299 hypothetical protein FU1 2890 . ■ 5.7 

132273 AA227710 Hs.43658 DKFZP586L151 protein . 4-2 

132276 AA653507 Hs.285711 hypothetical protein FU 13089 2.1 
132288 N36110 Hs.305971 solute carrierfamlty 2 (facilitated glucose transporter), member .1.5 

5 132294 AB023191 Hs.44131 KIAA0974 protein 10.0 

132298 NM.015986Hs.7120 cytokine receptor-like molecule 9 .1.9 

132299 AW405882 Hs.44205 cortistatin 9.2 
132325 N37065 Hs.44856 hypothetical protein FU1 21 16 , 20 
132348 AW06770B Hs.170311 heterogeneous nuclear ribonudeoproteln Wike 6.5 

10 132370 AW572805 Hs.46645 ESTs 3.8 

132374 AF155582 Hs.46744 corel UDP^a!actose:N-acetylgalactosarnIne-alpha-R beta 1,3-galactosyitransferase 1.5 

132376 AI279892 Hs.46801 sorting nexin 14 125 

132384 AA312135 Hs.46967 HSPC034 protein 283 
132393 AL135094 Hs.47334 hypothetical protein FU14495 .1.9 

15 132450 AA100012 fc.48827 hypothetical protein FLJ1 2085 1.9 

132452 AW973521 Hs.247324 mitochondrial ribosomal protein S14 6.1 

132456 AB011084 Hs.48924 KIAA0512 gene product; ALEX2 17 

132465 AW169847 Hs.49169 KIAA1 634 protein 8.6 

132470 AI224456 Hs.4934 H.saplensporyAsiteDNA 5.2 

■20 132484 X16660 Hs.1 1 9007 RAB4, member RAS oncogene family 14 

132518 AW885606 Hs.5064, ESTs 6.1 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromosomes 4, yeastHike 1 3.3 

132530 AA306105 Hs.50785 SEC22, vesicle trafficking protein (S. cerevisiae)-Uke 1 2.0 

132532 AA454132 . Hs.5080 mitochondriai ribosomal protein L16 2.9 

25 132534 BE388673 Hs.5086 hypothetical protein MGC1 0433 2.2 

132543 BE568452 H&5101 protein regulator of cytokinesis 1 7.3 

132571 AW674699 Hs.5169 suppressor of G2 allele of SKP1 , S. cerevislae, homolog of 17 

132574 AW631437 Hs.5184 TH1 drosophila homolog 7.1 

132596 AK001484 Hs.5298 CGI-45 protein 2.2 

30 132611 AA345547 Hs.53263 hypothetical protein FU13287 . 2.2 

132612 H12751 Hs.5327 PR01914 protein 6.8 

132616 BE262677 Hs.283558 hypothetical protein PR01855 14.0 

132638 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9928 expressed sequence 114 

132648 U51127 Hs.54434 hypothetical protein MGC1715 18 

35 132668 AB018319 Hs.5460 KIAA0776 protein 2.6 

132692 AW191962 Hs.249239 collagen, type Vlll, alpha 2 2.0 

. 132715 F11875 Hs.5534 Homo sapiens cDNA FU12961 Rs, clone NT2RP2005645 15 

132718 NrVL004600Hs.554 Sjogren syndrome antigen A2 (60kD, ribonudeoprotein autoantigen SS-A/Ro) 3.0 

132724 A1142265 Hs55498 geranylgeranyl diphosphate synthase 1 2.4 

40 132731 AI189075 Hs.301 872 hypothetical protein MGC4840 12.4 

132744 AA010233 Hs.55921 glutamyl-prolyl-tRNA synthetase 14.6 

132760 AA125985 Hs.56145 thymosin, beta .identified In neuroblastoma cells 2.7 

132771 Y10275 Hs.56407 phosphoserine phosphatase 3.0 

132773 AA459713 Hs.295901 KIAA0493 protein 2.3 . 

45 132784 AI142133 Hs.56845 GDP dissodatton inhibitor 2 18 

132798 AI026701 Hs.5716 KIAA0310 gene product 3.7 

132807 U07418 Hs.57301 mutL (E. coli) homolog 1 (colon cancer, nonpotyposis type 2) 18 

132810 AB007944 Hs.5737 KIAA0475 gene product 5.9 

132813 BE313625 Hs.57435 solute earner family 11 (proton-coupled divalent metal ion transporters), member 2 8.7 

50 132815 AI815189 Hs.57475 sex comb on midteg homolog 1 6.4 

132817 N27852 Hs.57553 tousled-like kinase 2 3.6 

132821 AJ251595 Hs.169610 CD44 antigen (homing function and Indian blood group system) 2.8 

132833 U78525 Hs.57783 eukaryotic translation initiation factor 3, subunit 9 (eta, 116kD) 14.8 

132842 NWL016154Hs.279771 Homo sapiens done PP1596 unknown mRNA , 16 

55 132844 F12200 Hs.5811 chromosome 21 open reading frame 59 2.5 

132851 U09716 Hs.287912 lectin, mannose-binding, 1 14 

132863 BE268048 Hs.236494 RAB10, member RAS oncogene family 4.2 
132869 AW963217 Hs.203961 ESTs, Moderately similar to AF116721 89 PR02168 [H.saplens] . 2.8 

132873 AW007683 Hs.58598 KIAA1 266 protein 2.0 

60 132875 NrVL004850Hs.58617 Rho-assodated, coiled-coll containing protein kinase 2 16 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary factor 1 (nonstandard symbol) 1.4 

132897 AW503667 Hs.59545 ring finger protein 15 .5.4 

132902 AI936442 Hs.5983B hypothetical protein FL) 10808 6.1 

132912 AW732760 Hs.167578 Homo sapiens cDNA FU1 1095 fis, done PLACE1005374 7.1 . 

65 132913 W78714 Hs.60257 Homo sapiens cDNA FU1 3598 fis, done PLACE1009921 2.8 

132940 T79136 Hs.127243 Homo sapiens mRNA for KIAA1724 protein, partial cds 6.1 

132941 AI817165 Hs.6120 hypothetical protein FU 13222 .10.3 
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132942 AA554458 Hs.197751 KJAA0666 protein .1.8 

132952 A1658580 Hs.61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 12 

132962 AA576635 Hs.6153 CGM8 protein 4.9 

132972 AA034365 Hs.288924 Homo sapiens cDNAFU11392fis l clone HEMBA1000575 . 2.7 

132973 AA035446 Hs.323277 ESTs 5.3 
132977 AA093322 Hs.301404 RNA binding moBf protein 3 .3.2 
132980 AA040696 Hs.62016 ESTs U 
132994 AA112748 Hs.279905 clone HQ0310PRO0310p1 , 3.0 . 
133012 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA 10.3 

133015 AJ002744 Hs.246315 UDP-N-acetyt-aipha-D^alactosamto 2.1 

133016 AI439688 Hs.6289 hypothetical protein FU20886 1-3 
133053 AI065016 Hs.6390 Homo sapiens clone FLB3344 PRO0845 mRNA, complete cds 6.0 
133062 AW500374 Hs.64056 PRO0149 protein 5-3 
133069 BE247441 Hs.6430 protein with polyglutamine repeat; calcium (ca2+) homeostasis endoplasmic reticulum protein 4.9 
133091 AK001628 "Hs.64691 KIAA0483 protein 3.5 
133110 AA808177 Hs.65228 ESTs 13.1 
133134 AF198620 Hs.65648 RNA binding motif protein 8A 1.3 
133145 H94227 Hs.6592 Homo sapiens, clone IMAGE:2961 368, mRNA, partial cds 2.2 
133152 Z11695 Hs.324473 mitogen-activated protein kinase 1 13 

133174 AA431620 Hs.324178 hypothetical protein MGC2745 17.1 

133175 AW955632 Hs.66666 ESTs, Weakly similar to S19560 proline-rich protein MP4 - mouse [M.musculus] 1.8 
133177 X97795 Hs.66718 RAD54 (S.cerevislae)-like 4.9 
133197 AI275243 Hs.180201 hypothetical protein FU20671 3.1 
133208 A1801777 . Hs.6774 ESTs 44 
133226 AW954569 Hs.296287 Homo sapiens, Similar to bromodomain-contaming 4, clone IMAGE:3542455, mRNA, partial cds 1.7 
133228 AI492924 Hs.6831 golgi phosphoproteln 1 6.0 
133240 AK001489 Hs.242894 ADP-ribosylation factor-like 1 1.5. 
133254 AI567421 Hs.273330 Homo sapiens, clone IMAGE:3544662, mRNA, partial cds 1.4 
133266 AI160873 Hs.69233 zinc finger protein 5.6 
133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2JHUMAN FORKHEAD BOX PROTEIN D2 [H.sapiens] 1.9 
133285 M76477 Hs.289082 GM2 ganglioslde activator protein ,4.7 
133291 BE297855 Hs.69855 NRAS-related gene 5.0 
133314 AA102670 Hs,70725 gamma-aminobutyric acid (GABA) A receptor, pi 2.7 
133321 T79526 Hs.179516 Integral type I protein 9-3 
133327 AL390127 Hs.7104 KruppeWike factor 13 4.4 
133347 BE257758 Hs.71475 acid cluster protein 33 1.8 
133360 AI016521 Hs.71816 v-akt murine thymoma viral oncogene homolog 1 5.5 

133366 AA292811 Hs.72050 non-metastatic cells 5, protein expressed in (nucleoside-diphosphate kinase) 2.7 

133367 AF231919 Hs.18759 KIAA0539 gene product 1.7 
133370 AF245505 Hs.72157 DKFZP564I1922 protein 18 
133383 BE313555 Hs.7252 KIAA1 224 protein 1.7 

133390 AI950382 Hs.72660 phosphatidylserine receptor 1.3 

133391 AW103364 Hs.727 Inhibin, beta A (adh/ln A, actlvln AB alpha polypeptide) 16.1 
133394 AA305127 Hs.237225 hypothetical protein HT023 12.2 
133437 AL031591 Hs.7370 phospholidylinositol transfer protein, beta 10.4 

133452 NM_002759Hs.274382 protein kinase, interferon-Inducible double stranded RNA dependent 1.2 

133453 AI659306 Hs.73826 protein tyrosine phosphatase, non-receptor type 4 (megakaryocyte) 1.7 
133500 AW964804 Hs.74280 hypothetical protein FU22237 11-1 
133529 W45623 Hs.74571 ADP-ribosylation fector 1 2.8 
133540 AL037159 Hs.74619 proteasome (prosome, macropain) 26S subunlt, norvATPase, 2 2.9 
133543 AU077073 Hs.108327 damage-specific DNA binding protein 1 (127kD) 2.5 

133578 AU077050 Hs75066 transOn 1.5 

133579 X75346 Hs.75074 mitogen-activated protein kinase-activated protein kinase 2 2.1 
133582 BE391579 Hs.75087 Fas-activated serine/threonine kinase . 1.3 . 

133594 AW160781 Hs.172589 nuclear phosphoprotein similar to S. cerevislae PWP1 2.2 

133595 AA393273 Hs.75133 transcription factor 6-like 1 (mitochondrial transcription factor 1-like) .1.5 
133599 NM-002885Hs.75151 RAP1, GTPase activating protein 1 57 
133621 NMJ)04893Hs.75258 H2A histone family, member Y 255 
133627 NM_002047Hs.75280 glycyRRN A synthetase • 15.8 
133631 NMJW0401HS.75334 exostoses (multiple) 2 3.3 
133649 U25849 Hs.75393 acid phosphatase 1, soluble 1.6 
133690 AV661185 Hs.75574 mitochondrial ribosomal protein L19 . 4.1 
133720 L27841 Hs.75737 pericentriolar material 1 1.5 
133722 AW969976 Hs.279009 matrix Gla protein 6.3 . 
133751 AW402048. Hs.334787 Homo sapiens, Similar to likely ortholog of yeast ARV1, clone IMAGE:3506392, mRNA 3.9 
133757 T52946 Hs.1 96209 RAE1 (RNA export 1, S.pombe) homolog 1.7 
133760 BE271766 Hs.181357 laminin receptor 1 (67kD, ribosomal protein SA) 1.8 
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133765 M62194 Hs.75929 cadherin 11, type 2. OB«adherfn (osteoblast) u 

"133780 AA557660 Hs.76152 decortn 3.5 

133784 BE622743 Hs.301064 arfaplln 1 6.8 

133791 M34338 Hs.76244 spermidine synthase 2.6 

133797 AL133921 Hs.76272 retinoblastoma-blnding protein 2 \A 

133822 D50525 Hs.699 peptidylprolyl Isomerase B (cyctophilin BJ . . 8.0 

133842 AW797468 Hs.285013 putative human HLA class II associated protein I 135 

133845 AA147026 Hs.76704 ESTs 22 

133850 W29092 Hs.7678 cellular retinoic add-binding protein 1 .1.8 

133859 U86782 Hs.178761 26S proteasome-associated pad1 homolog 2.0 

133865 AB011155 Hs.170290 discs, large (Drosophila) homolog 5 2,8 

133867 AW340125 Hs.76989 KIAA0097 gene product 6.7 

133868 AB012193 Hs.183874 cullMA 2.5 
133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosln) • 3.0 
133922 U30825 Hs77608 splicing factor, arginlne/serine-rich 9 1.4 
133924 D86326 Hs.325948 vesicle docking protein p115 5.4 
133929 NIVL006306HS.211602 SMC1 (structural maintenance of chromosomes 1, yeastHike 1 4.9 
133936 L17128 Hs.77719 gamma-glutamyl carboxylase 3.7 
133941 BE244332 Hs.77770 adaptor-related protein complex 3, mu 2 subunit 12.1 
133959 X81769 Hs.77897 splicing factor 3a, subunit 3, 60kD 9.7 
133976 AI908165 Hs.169946 GATA-bindlng protein 3 (T-ce!l receptor gene activator) 3.1 
133989 AL040328 Hs78202 SWI/SNF related, matrix associated, actin dependent regulator of chromatin 1.3 
133997 AI824113 Hs.78281 regulator of G-protein signalling 12 9.7 
134010 AB016092 Hs.197114 RNA binding protein; AT-rich element binding factor 2.4 
134015 D31764 Hs.278569 sorting nexin 17 2.5 
134070 NM_003590Hs78946 cullin3 1.3 
134110 U41060 Hs.79136 LIV-1 protein, estrogen regulated 4.2 
134129 NM_014742Hs79305 KIAA0255 gene product 2.2 
134134 H86504 Hs.173328 protein phosphatase 2, regulatory subunit B (B56), epstlon Isoform 5.0 
134200 BE559598 Hs.197803 KIAA0160 protein " 3.2 
134206 AF107463 Hs.79988 splicing factor 30, survival of motor neuron-related 2.5 
134208 NMJ)00288HsJ9993 peroxisomal biogenesis factor 7 2.1 
134219 NM_000402Hs.80206 glucose-phosphate dehydrogenase 9.1 
134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, partial cds 2.8 
134275 AI878910 Hs.3688 dsplatin resistance-associated overexpressed protein 1.8 
134292 AI906291 Hs.81234 immunoglobulin superfamBy, member 3 2.0 
134301 AW502505 Hs.81360 Homo sapiens cDNA: FU21927 fis, clone HEP04178, highly similar to HSU90909 2.5 
134305 U61397 Hs.81424 ubiquitin-like 1 (sentrin) 2.8 
134324 AB029023 Hs.179946 K1AA1 100 protein 10.4 
134326 AW903838 Hs.81800 chondroifjn sulfate proteoglycan 2 (versican) 1.9 
134329 N92036 Hs.81848 RAD21 (S. pombe) homolog 2.6 
134337 NrVL004922Hs.81964 SEC24 (S. cerevisiae) related gene family, member C 2.3 
134348 AW291946 Hs.82065 interteukin 6 signal transducer (g p130, oncostatin M receptor) 13.0 
134367 AA339449 Hs.82285 phosphoribosylglycinamlde formyltransferase, phosphoribosylglyclnamide synthetase, 8.8 
134376 X06560 Hs.82396 Z^-oiigoadenylate synthetase 1 (4046 kD) 15 
134379 AW362124 Hs.323193 hypothetical protein MGC3222 8.1 
134384 AI589941 Hs.8254 Homo sapiens, Similar to tumor differentially expressed 1, clone IMAGE:3639252, mRNA, partial cds Z6 
134391 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-Iike repeat domains) .4.1 
134395 AA456539 Hs.8262 lysosomal 17 
134403 AA334551 Hs.82767 sperm specific antigen 2 . 2.6 
134405 AW067903 Hs.82772 collagen, type XJ, alpha 1 1.3 
134411 BE272095 Hs.167791 reticulocalbln 1, EF-hand calcium binding domain 3.2 
134415 AI750762 Hs.82911 protein tyrosine phosphatase type IVA, member 2 1.9 
134421 AU077198 Hs.82985 collagen, type V, alpha 2 10.3 
134424 Z44190 Hs.83023 peroxisomal biogenesis factor 11 B 2.4 

134446 AA1 12036 Hs.83419 KIAA0252 protein 1.2 

134447 M58603 Hs.83428 nudear factor of kappa light polypeptide gene enhancer In B-cells 1 (p105) 1.6 
134470 X54942 Hs.83758 CDC28 protein kinase 2 2.1 
134480 NM_005000Hs,83916 Empirically selected from AFFX single probeset 5.3 
134485 X82153 Hs.83942 cathepsin K (pycnodysostosis) 23 
134498 AW246273 Hs.84131 threonyMRNA synthetase 2.1 
134513 AA425473 Hs.84429 KIM0971 protein 3.8 
134516 AK001571 Hs.273357 hypothetical protein FU1 0709 2.4 
134520 BE091005 Hs.74861 activated RNA polymerase II transcription cofactor4 6.7 
134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.3 
134577 BE244323 Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 5.5 
134582 AA927177 Hs.86041 CGG triplet repeat binding protein 1 5.8 
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134612 AW068223 Hs.171581 
134624 AF035119 Hs.8700 
134632 X78520 Hs.174139 
134654 AK001741 Hs.8739 
5 134664 AA256106 Hs.87507 
134666 BE391929 Hs.8752 
134687 U62317 Hs.88251 
134692 NM_003474Hs.8850 
134705 BE161887 Hs.88799 

10 134714 Y14768 Hs.890 
134719 M852985 Hs.89232 
134722 AF129536 Hs.284226 
134724 AF045239 Hs.321576 
134746 X07871 Hs.89476 

15 134751 AW630803 Hs.89497 
134790 BE002798 Hs.287850 
134806 AD001528 Hs.89718 
134834 AW451370 Hs.8991 
134850 AI701162 Hs.90207 

20 134853 BE268326 Hs.90280 
134859 D26488 Hs.90315 
134880 AI879195 Hs.90606 
134910 AA532963 Hs.9100 
134925 AW885909 Hs.6975 

25 134955 AW401361 Hs.91773 
134971 AI097346 Hs.286049 
134975 R50333 Hs.92186 
135011 AB037835 Hs.92991 
135022 NMJ50O4O8Hs.932O1 

30 135032 AW301984 Hs.173685 
135077 AW503733 Hs.9414 
135083 AB036063 Hs.94262 

135095 AF027219 Hs.9443 

135096 AA081258 Hs.132390 
35 135153 AI093155 Hs.95420 

135181 BE250865 Hs.279529 
135199 AA477514 Hs.96247 
135207 N26427 Hs.9634 
135214 T78802 Hs.96560 

40 135243 BE463721 Hs.97101 
. 135245 AI028767 Hs.262603 
135257 AW291023 Hs.97255 
135263 AI088775 Hs.55498 
135274 AA448460 Hs.1 12017 

45 135294 AA150320 Hs.9800 
135295 AI090838 Hs.98006 
135307 AI743770 Hs.98368 
135321 AI652069 Hs.98614 
135354 AA456454 Hs.183418 

50 135361 AA373452 Hs.167700 
135389 U05237 Hs.99872 
135400 X78592 Hs.99915 
134975 R50333 Hs.92186 
. 135011 AB037835 Hs.92991 

55 135022 NM.00G408HS.93201 
135032 AW301984 Hs.1736B5 
135077 AW503733 Hs.9414 
135083 AB036063 Hs.94262 
135095 AF027219 Hs.9443 

60 135096 AA081258 Hs.132390 
135153 AI093155 Hs.95420 
135181 BE250865 Hs.279529 
135199 AA477514 Hs.96247 
135207 N26427 Hs.9634 

65 135214 T78802 Hs.96560 
135243 BE463721 Hs.97101 
135245 AI028767 Hs.262603 



ubiquitin C-terminal hydrolase UCH37 
deleted in Ever cancer 1 
chloride channel 3 
hypothetical protein FU10879 
ESTs 

transmembrane protein 4 



a disintegrin and metalloproteiriase domain 12 {meltrin alpha) 

anaphase-promotmg complex subunft 10 

lysosomal 

chromobox homolog 5 (Drosophila HP1 alpha) 
F-box only protein 6 
ring finger protein 22 

CD2 antigen (p50), sheep red blood eel! receptor 
laminBI 

Integra) membrane protein 1 
spermine synthase 

adaptor-related protein complex 1 , gamma 2 subunlt 
hypothetical protein MGC1 11 38 

5-aminoimidazoIe-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase 
KIAA0007 protein 
15 kDa selenoprotein 

Homo sapiens cONA FU13100 fls, done NT2RP3002255 
PRO1073 protein 

protein phosphatase 2 (formerly 2A), catalytic subun'rt, alpha Isoform 
phosphoserine aminotransferase ' 
Leman colled-coD protein 
KIAA1414 protein 

glyceroI-3-phosphate dehydrogenase 2 (mitochondrial) 
hypothetical protein FU12619 
KIAA1488 protein 

p53-indudble ribonucleotide reductase small subunft 2 homolog 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 



translln-associated factor X 

ESTs, Highly similar to C10_HUMAN PUTATIVE C10 PROTEIN [Haptens] 
hypothetical protein FU11656 
putative G protein-coupled receptor 
ESTs 

ESTs, Weakly similar to A46010 X-linked retinopathy protein [Ksaptens] 

geranytgeranyl diphosphate synthase 1 

GE36gene. 

protein kinase Njmu-R1 

ESTs 

ESTs, Weakly simitar to K1AA0822 protein [H.sapiens] 

ribosome binding protein 1 (dog 180kD homolog) 

cell division cycle 2-like 1 (PITSLRE proteins) 

Homo sapiens cDNA FU10174 fls, done HEMBA1003959 



androgen receptor (dihydrotestosterone recepton testicular feminization; spinal and bulbar 
Leman coiied-coil protein 
K1AA1414 protein 

gtycerol-3-phosphate dehydrogenase 2 (mi 
hypothetical protein FU12619 
KIAA1488 protein 

p53-inducible ribonucleotide reductases 

zinc finger protein 202 

zinc finger protein 36 (KOX 18) 

JM27 protein 

px19-like protein 

transiin-associated factor X 

ESTs, Highly similar to C10JHUMAN PUTAT1 

hypothetical protein FU11656 

putative G protein-coupled receptor 

ESTs 
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135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-linked 
135263 AI088775 Hs.55498 geranylgeranyl diphosphate synthase 1 
135274 AA448460 Hs.1 12017 GE36gene 

135294 AA150320 Hs.9800 protein kinase Njmu-R1 

135295 AI090838 Hs.98006 ESTs 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein . 

135321 AI652069 Hs.98614 rlbosome binding protein 1 (dog 1B0kD ho 

135354 AA456454 Hs.183418 cell division cyde 2-like 1 (PiTSLRE pr 

135361 AA373452 Hs.167700 Homo sapiens cDNA FU10174 fis, done HE 

135389 U05237 Hs.99872 fetal Alzheimer antigen 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 

302256 AA857131 Hs.171595 HIV TAT specific factor 1 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erW)2, 

303135 AW592789 Hs.279474 HSPC070 protein 

303686 AK000714 Hs.109441 MSTP033 protein 

310085 R43191 Hs.101248 Homo sapiens clone IMAGE:32553, mRNAseq 

315518 AA808229 Hs.167771 ESTs 

317781 NM_007057Hs.42650 ZW10 Interactor 

320836 AI268997 Hs.197289 rab3 GTPas^activating i protein, non-cata 

321114 AA902256 Hs.78979 Golgi apparatus protein 1 

322221 N24236 Hs.1 79662 nudeosome assembly protein Mike 1 . 

322474 AF1 18083 Hs.29494 PR0191 2 protein 

322556 BE041451 Hs.177507 hypothetical protein 

323541 AF292100 Hs.104613 RP42homolog 

407827 BE278431 Hs.40323 BUB3 (budding uninhibited by benzimidazo 

408196 AL034548 Hs.43627 SRY (sex determining region Y)-box 22 

408813 AI580090 Hs.48295 RNA helfcase family 

409176 R73727 Hs.101617 ESTs, Weakly similar to T32527 hypotheti 

413670 AB000115 Hs.75470 hypothetical protein, expressed in osteo 

414108 AI267592 Hs.75761 SFRS protein kinase 1 

414846 AW304454 Hs.77495 UBX domain-containing 1 

416980 AA381133 Hs.80684 high-mobility group (nonhistone chromoso 

417378 R57256 Hs.82037 TATA box binding protein (TBP}associate 

418283 S79895 Hs.83942 cathepsin K (pycnodysostosis) 

418467 NM_006910Hs.85273 retinoblastoma-blnding protein 6 

420269 U72937 Hs.96264 alpha thalassemla/mental retardation syn 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 

421225 AA463798 Hs.102696 MCT-1 protein 

421642 AF172066 Hs.106346 retinoic acid repressible protein 

421828 AW891965 Hs.279789 histone deacetylase 3 

421983 AI252640 Hs.1 10364 peptldylprolyl isomerase C (cyclophDin 

422052 AA302744 Hs.104518 ESTs 

422055 NM_014320Hs.1 11029 putative heme-blndlng protein 

423750 AF165883 Hs.298229 prefoldin 2 

424001 W67883 Hs.137476 paternally expressed 10 (PEG10; WAA105 

425182 AF041259 Hs.155040 zinc finger protein 217 

425284 AF155568 Hs.155489 NS1-assodated protein 1 

426372 BE304680 Hs.169531 DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 

428049 AW183765 Hs.182238 GW128 protein 

428477 AW500533 Hs.1 1482 splicing factor, arglnine/serine-rich 1 1 

437562 AB001638 Hs.5683 DEAD/H (Asp-Glu-Ala^sp/H is) box polypep 

438449 AK001333 Hs.6216 Homo sapiens hepatocellular cardnoma-as 

441560 F13386 Hs.7888 Homo sapiens done 23736 mRNA sequence 

445580 AF167572 Hs.12912 skbl (S. pombe) homolog 

446999 AA151520 Hs.334822 hypothetical protein MGC4485 

447111 AI017574 Hs.17409 cysteine-rich protein 1 (intestinal) 

447778 BE620592 Hs.71190 ESTs, Weakly similar to S16506 hypotheti 

448873 NM.003677HS.22393 density-regulated protein 

449687 W68520 Hs.331328 Intermediate filament protein syncoiBn 

450701 H39960 Hs.288467 Homo sapiens cDNA FU 12280 fis, done MA 

450703 AA011202 Hs.184771 nudear factor l/C (CCAAT-blnding transc 

452461 N78223 Hs.108106 transcription factor 

452511 BE408178 Hs.285165 Homo sapiens cDNA FU20845 fis, done AD 

453157 AF077036 Hs.31989 DKFZP586G1722 protein 

453658 BE541906 Hs.87819 Homo sapiens, done MGC:2492, mRNA, comp 

100833 AF135168 Hs.108602 N-ethylma!effnide-sensitive factor 

102481 U50360 gb:Human caldum, calmodulln-dependent p 
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102827 BE24458fLHs.6456 chaperonln containing TCPl.subunit 2 (b 7.9 

103549 BE270465 Hs.78793 protein kinase C, zeta 2-0 

104331 AB040450 Hs.279862 odk Inhibitor p21 binding protein 5.3 

110018 AW579842 Hs.104557 hypothetical protein FU10697 .■ 2.0 

5 . 115008 AK001827 Hs.87889 helicase-mol ' 5.7 

119075 M10905 Hs.287620 fibronectin 1 \ 1 - 3 

119615 AL034423 Hs.75875 ublquitin-conjugating enzyme E2 variant 2.9 

125006 BE065136 Hs.145696 splicing factor (CC1. 3) 1.7 

127609 X80031 Hs.530 collagen, type IV, alpha 3 (Goodpasture 2.4 

10 129209 R62676 Hs.17820 Rho-associated, coiled-coB containing p 5.2 

129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B), reg 4.5 

130182 BE267033 Hs.192853 ublquitin-conjugating enzyme E2G 2 (homo 11.0 

130365 W56119 Hs.155103 eukaryotic translation initiation factor 3.3 

131135 NNL016569HS.267182 TBX3-lso protein U 

15 131853 AI681917 Hs.3321 ESTs, Highly similar to IRX1JHUMAN IROQU 3.2 

131881 AW361018 Hs.3383 upstream regulatory element binding prot 14.3 

132726 N52298 Hs.55608 hypothetical protein MGC955 3.0 

135193 X95525 . Hs.96103 TATA box binding protein (TBP)-associate 2.7 

409487 H19886 gb:yn57a05.r1 Soares adult brain N2b5H 2.3 

20 416040 AW819158 Hs.289044 Homo sapiens cDNA FLJ 12048 fis, clone HE 7.4 
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TABLE 4 A 

Table 4 A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 

sequences comprising each cluster are listed in the "Accession" column. 

» 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 



123615 
123619 
101445 
124385 
124417 
124482 
102481 
103349 
110856 



120280 
113248 
120472 



30686,-15 

371681J 

165CL-5 

656394 1 

1642364J 

1657509.1 

3128L-28 

1105^-2 

19346.14 . 



160212.1 
328626J 
44573.2 



129019 44573J 



120695 9683.3 



122188 
121581 
122618 
109026 
123658 
123811 
125115 
125147 
118737 
120274 
113196 
120504 
120639 



275673J 
283769J 
305217J 
150431.1 



AA609170 

AA602964AA609200 
M21259 

AI267847 N27351 
N34059 N46979 
N53935 N53950 
U50360 
X89059 

AA992380 N33063 N21418 H79958 R21911 H79957 

103797 109699.1 AA080912 M075318 AA083403 AA076594 AA078992 AA084926 AA081881 AA1 1391 3 AA1 13892 
AA083821 AA134801 AA082953 AA070343 AA062835 AA075419 AA063293 AA071252 M078900 AA062836 AW974305 

AA190577 AA181657 

T63857 AW971220 AA493469 T63699 

AI950087 N70208 R97040 N35809 A13081 1 9 AW967677 N35320 AE251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 A1890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 A1763273 AW1 73586 AW150329 AI653832 
AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA1B0009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 A1633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75528 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA300207 AW959581 T63226 F04005 
AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
A1219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI672193 AI763273 AW 173586 AW150329 AI653832 
A1762688 AA988777 AA488892 AI356394 AW103813 A1539642 AA642789 AA856975 AW505512 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 AA251669 AA251874 
AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 A1564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F04005 
AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 AI796100 AI935603 AW052210 
AA970201 AI633384 AA425910 A1017004 AI241295 AA402816 AA291468 
AA398838 AA435847 

AA416568 AA442889 AA417233 AA442223 
AA453641 M454061 



AA157811AA836869 
genbanKJ\A609364 AA609364 
genbankuAA620586 AA620586 
genbank_T97341 T97341 
NOT.FOUND.entre*JV38150 
382979 1 AA199686 N73861 
genbanLAA177051 AA177051 
genbanK_T57317 T57317 
genbankJ\A256837 AA256837 
genbanleAA286942 AA286942 
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120809 genban)LAA346495 AA346495 _ 
113702 genbankJ97307 T97307 

129680 23162J U03749 NM.001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 

A1858999 D55958 AI684005 D53170 AA854091 AI025609 D531 19 D54729 D55504 D55377 D55313 AW512244 AA846441 
5 AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 AA843095 AA772028 AI148432 

AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 M843125 AAB43419 AA319036 AA319054 
AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 AA854232 W39212 W15214 AA894441 
AJ803081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 M889304 AA976699 AI687001 
AI621107 AI865540 AA772107 C06286 AA319661 AA405992 
10 101045 entre2L_J05614 J05614 

117247 genbank N21032 

110501 genbank H55748 

103392 entreO<94563X94563 

105032 genbank AA127818 
15 119513 NOT.FOUND.entrez W37933 

105445 genbank AA252395 

121514 genbank AA412112 

121558 genbank AA412497 

121911 genbank AA427950 

20 123315 714071 J AA496369 AA496646 * ' , 
114911 genbank AA238672 
409487 1134778J H19886 AW402806T10231 
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TABLE 5: Figure 5 from BRCA 001 US 



Table 5 shows genes upregulated in tumor tissue compared to nonnal breast tissue. 



Pkey; Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Un^ene Title: Unigene gene title 

R1: Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnigeneTitle R1 

100114 X02308 Hs.82962 thymidyfate synthetase . 2.9 

100147 D13666 Hs.136348 ■ osteoblast specific factor 2 (fesdcfln 7.5 

100154 H60720 Hs.81892 KIAA01 01 gene product 9.2 

100335 AW247529 Hs.6793' platelet-activating factor acetylhydrola 2.7 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 5.7 

100667 L05424 Hs.169610 CD44 antigen (homing function and Indian 9 
100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 7.6 
100678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 
100988 AK000405 Hs.76480 ubiquitin-iike 4 11.4 
101031 J05070* Hs.151738 matrix metaltoprotefnase 9 (gelatinase B 82 
101045 J05614 gb:Human proliferating cell nuclear ant! 5 
101332 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) 3.4 
101352 AI494299 Hs.16297 COX17 (yeast) homolog, cytochrome coxW 6.3 
101580 NM_012151Hs.83363 coagulation factor Vlll-associated (intr 5.7 
101592 AF064853 Hs.91299 . guanine nucleotide binding protein ( 5.6 
101767 M81057 Hs.180884 carboxypeptidase B1 (tissue) 14.4 
101806 AA586894 Hs.112408 S100 calcium-binding protein A7 (psorias 8.9 
101810 NM_00031 8Hs.1 80612 peroxisomal membrane protein 3 (35kD, Ze 32 
101983 AJ904232 HsJ5323 prohibitin 8.4 
102107 BE258602 Hs.182366 heat shock protein 75 1.4 
102165 BE313280 Hs.159627 death associated protein 3 4.6 
102198 AW950852 Hs.74598 polymerase (DNA directed), delta 2, regu 4.3 
102217 AA829978 Hs.301613 JTVIgene 6.7 
102220 U24389 Hs.65436 lysosomal 43 
102302 AA306342 Hs.69171 protein kinase C-fike 2 2.7 
102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 2 
102374 U33635 Hs.90572 PTK7 protein tyrosine kinase 7 62 
102455 U48705 Hs.75562 discoidin domain receptor family, member 6.9 

' 102568 W81489 Hs.223025 RAB31, member RAS oncogene family 5.3 

102618 AL037672 Hs.81071 extracellular matrix protein 1 5.8 

102687 NMJ07019HS.93002 ubiquitin carrier protein E2-C 4.3 

102689 U96132 Hs.171280 hydroxyacyl-Coenzyme A dehydrogenase, ty 6 

102704 AU077058 Hs.54089 BRCA1 associated RING domain 1 1.9 

102705 T97490 Hs.50002 small inducible cytokine subfamily A (Cy 2.3 
102801 BE252241 Hs.38041 pyridoxal (pyridoxine, vitamin B6) Jdnas 6.4 
102827 BE244588 Hs.6456 chaperpnin containing TCPl.subunlt 2 (b 5.6 
103060 NM_005940Hs.155324 matrix metalloproteinase 11 (MMP11;stro 4.5 
103080 AU077231 Hs.82932 cydin D1 (PRAD1: parathyroid adenomatos 3.1 
103178 AA205475 Hs.275865 ribosomal protein S18 9.9 
103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 
103238 AI369285 Hs.75189 death-associated protein 5.6 
103547 AI376722 Hs.1 80062 proteasome (prosome, macrapain) subunit, 9.7 
103549 BE270465 Hs.78793 protein kinase C, zeta 7.9 
103886 AK001278 Hs.105737 hypothetical protein FLJ10416 similar to 6.5 
104325 BE379766 Hs.150675 polymerase (RNA) II (DNA directed) polyp 6.3 
104827 AW052006 Hs.8551 PRP4/STK/WD splicing factor 105 
104846 AI250789 Hs.32478 ESTs 5.6 
104854 AA041276 Hs.154729 3-phospholnositide dependent protein ktn 12,3 
104867 AA278898 Hs.225979 hypothetical protein similar to small G 2 
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04896 AW015318 Hs.23165 
04909 AW408164 Hs.249184 
04916 AW958157 Hs.155489 
04919 AA026880 Hs.25252 
04974 Y12059 Hs.278675 
04978 AI199268 Hs.19322 
05012 AF098158 Hs.9329 
05039 AA907305 Hs.36475 
05079 AA151342 Hs.12677 
05088 H58589 Hs.35156 
05393 AF167570 Hs.256583 
05547 AA262640 ' Hs.27445 
05564 BE616694 Hs.288042 
05658 AA985190 Hs.246875 
05746 AW151952 Hs.46679 
05858 AF151066 Hs.281428 
05930 AF016371 Hs.9880 
06094 AA533491, Hs.23317 
06350 AK001404 Hs.194693 
06359 AW390282 Hs.31130 
06610 AA458882 Hs.79732 
06624 NM_003595Hs.26350 
06713 BE614802 Hs.184352 
06829 AW959893 Hs.27099 
06846 AB037744 Hs.34892 
06873 N49809 Hs.11197 
06973 BE156256 Hs.11923 
07029 AF264750 Hs.288971 
07197 W15477 Hs.64539 
07859 AW732573 Hs.47584 
07901 L42612 Hs.335952 
07922 BE153855 Hs.61460 
07974 AW956103 Hs.61712 
08274 AF129535 Hs.272027 
08647 BE546947 Ha.44276 
08695 AB029000 Hs.70823 
08894 AK001431 Hs.5105 
09011 AA156542 Hs.72127 
09068 AA164293 Hs.72545 
09273 AA375752 Hs.82719 
09468 NM_015310Hs.6763 
10240 AI668594 Hs.176588 
10330 AI288666 Hs.16621 
10501 H55748 
10742 AW190338 Hs.28029 
10762 BE044245 Hs.30011 
10856 AA992380 
10958 NNL005864HS.24587 
11125 N63823 Hs.269115 
11179 AK000136 Hs.10760 
11239 N90956 Hs.17230 
11285 AA778711 Hs.4310 
11392 W46342 Hs.325081 
11937 BE298665 Hs.14846 
12244 AB0290Q0 Hs.70823 
12995 AA737033 Hs.7155 
13777 BE265947 Hs.10590 
13791 A1269095 Hs,135578 
13811 BE207480 Hs.6994 
13834 T26483 Hs.6059 
13868 W57902 Hs.90744 
13870 AL079314 Hs.16537 
13923 AW953484 Hs.3849 
14275 AW515443 Hs.306117 
14895 AA236177 Hs.76591 
14965 AI733881 Hs.72472 
15061 AI751438 Hs.41271 



ESTs 

transcription factor 19 (SC1) 
NS1 -associated protein 1 . 
prolactin receptor 
bromodomaln-contalnlng 4 



17.7 
5 

1.7 
1.4 
1.4 



Homo sapiens, Similar to RIKEN cDNA 2010 72 

chromosome 20 open reading frame 1 3.3 

ESTs 2.5 

CG1-147 protein 9.5 

Homo sapiens cDNA FU1 1027 fls, clone PL 2.2 

interleukin enhancer binding factor 3, 9 5.4 

unknown 9.3 

hypothetical protein FU14299 1.4 

hypothetical protein FU20059 9.4 

hypothetical protein FU20739 1.5 

hypothetical protein 2.9 

peptidyt prolyl isomerase H (cyclophilin 5.2 

hypothetical protein FU 14681 6,8 

cycBn B2 5.7 

transmembrane 7 superfamily member 2 6.3 

fibuiinl 7.9 

tyrosylprotein sulfotransferase 2 7.7 

hypothetical protein FU12549 4.5 

hypothetical protein FU23293 similar to 1 62 

KIAA1323 protein 2.2 
Homo sapiens, clone IMAGE:3343149, mRNA, 16.8 

hypothetical protein 6.6 

myeloid/lymphoid or mixed-lineage leukem 1 .8 

glioma pathogenesis-related protein 6.1 

potassium voltage-gated channel, delayed 8.4 

keratin 68 2.5 

Ig superfamily receptor LNIR 22 

pyruvate dehydrogenase kinase, isoenzyme 6.7 

F-box only protein 5 7.1 

homeo box C10 9.8 

KIAA1077 protein 7.2 

hypothetical protein FL110569 4 

ESTs 1-4 

ESTs 2.9 
Homo sapiens mRNA; cDNA DKFZp586F1822 (f 2.9 

KIAA0942 protein 3.2 
ESTs, Weakly similar to CP4Y.HUMAN CYTOC A2 

DKFZP434I116 protein 62 

gb:yq94a01 .si Soares fetal liver spleen 6.1 

hypothetical protein MGC1 1256 7.6 

hypothetical protein M6C2963 9.3 
gb:ot37g06.s1 SoaresJestisJIHT Homo sap 2.3 

signal transduction protein (SH3 contain 6.7 
ESTs,ModeratelysimiiartoZ195^HUMANZ 3.6 

asporin(LRR class 1) 7.1 

hypothetical protein FU22087 75 

eukaryotic translation Initiation factor 65 
Homo sapiens, done IMAGE:3659680, mRNA, 8.4 
Homo sapiens mRNA; cDNA DKFZp564D016 (fr 10.6 

KIAA1077 protein 14.6 

ESTs I ModeratelysImilarto2115357ATYK 5.6 

zinc finger protein 313 13.4 

chltoblase, di-N-acetyl- 1-3 

Homo sapiens cDNA: FU22044 fis. clone H 3,1 

EGF-containing fibulin-iike extracellula 1 15 
proteasome (prosome, macropain) 26S subu 2.7 

hypothetical protein, similar to (U06944 6.1 . 

hypothetical protein FU22041 similar to 1.9 

K1AA0306 protein 155 

K1AA0887 protein 7.1 

BMP-R1B 2.3 

Homo sapiens mRNA full length Insert cDN 115 
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115278 

115291 

115652 

115693 

115941 

115968 

116011 

116417 

116470 

116637 

117132 

117881 

118528 

119075 

119265 

119349 

119403 

119789 

120206 

120253 

120297 

120325 

120327 

120349 

120356 

120371 

120383 

120386 

120389 

120396 

120418 

120472 

120484 

120570 

120582 

120596 

120624 

120695 

120713 

120750 

120774 

120807 

120809 

120984 

121081 

121408 

121505 

121508 

121513 

121549 

121558 

121655 

121744 

121748 

121773 

121832 

121839 

121882 

121911 

121999 

122013 

122036 

122356 

122371 

122372 

122460 

122490 



AK002163 

BE545072 

BE093589 

AF231023 

AI867451 

AB037753 

AL359053 

AW499664 

AI272141 

AK001043 

AI393666 

AF161470 

AI949952 

M10905 

BE539706 

T65004 

AL1 17554 

BE393948 

H26735 

AA131376 

AA1913B4 

AA195651 

AK000292 

AW969481. 

AF000545 

AA219305 

AL109963 

AW969665 

AW967985 

AA134006 

AW966893 

AI950087 

AA253170 

AA280679 

BE244830 

AA282074 

AW407987 

AA976503 

AW449855 

AI191410 

AI608909 

AA346385 

AA346495 

BE262951 

AA398721 

AA406137 

M494172 

M402515 

AA416653 

M412477 

M412497 

M421537 

AA398784 

BE536911 

AB033022 

AW340797 

AA425691 

M426376 

M427950 

AA430211 

AA431085 

W92142 

AA443794 

AA868555 

AA446008 

AW41878B 

AA448349 



Hs.301724 hypothetical protein FU11301 1.5 

Hs.122579 hypothetical protein FLJ1Q461 6.2 

Hs.38178 hypothetical protein FLI23468 10.6 

Hs.55173 cadherin, EGF LAG seven-pass G-type rece 6.8 

Hs.46679 hypothetical protein FU20739 5.5 

Hs.62767 KIAA1332 protein 9.8 

Hs.57664 Homo sapiens mRNA full length Insert cDN 2,4 

Hs.12484 Human clone 23826 mRNA sequence 7.4 

Hs.83484 SRY (sex determining region Y)-box 4 2.1 

Hs.92033 Integrin-linked klnase-associated serine 2.7 

Hs.42315 p10-binding protein 5.2 

Hs.260622 butyrate-lnduced transcript 1 5.7 

Hs.49397 ESTs 7.4 

Hs.287820 fibronectlnl 5.7 

Hs.285363 ESTs 1.4 

Hs.163561 ESTs 8.4 

Hs.1 19908 nudeolarprotein NOP5/NOP58 6.7 

Hs.50915 kallikreln 5 (KLK5; KLK-L2; stratum com 9.2 

Hs.91668 Homo sapiens clone PP1498 unknown mRNA 45.7 
Hs.326401 ■ fibroblast growth factor 12B 38,9 

Hs.104072 ESTs, Weakly similar to Z195J1UMAN ZINC 15.2 

Hs.104106 ESTs 6.4 

Hs.278732 hypoftietical protein FU20285 16.1 

Hs.55189 hypofoetical protein 16.8 

Hs.296433 putative purinergic receptor ' 28.1 

Hs.104196 EST 12.4 

Hs.123122 FSH primary response (LRPR1, rat) homolo 9.7 

Hs.154848 hypothetical protein DKFZp434D0127 32.6 

Hs.325572 ESTs, Moderately similar to ALU7JWMAN A 21 .7 

Hs.79306 eukaryotic translation initiation factor 1 25 

Hs.26613 Homo sapiens mRNA; cDNA DKFZp586F1323 (f 11.4 

gbwq05c02j(1 NCI_CGAPJ<id12 Homo sapien 19.4 

Hs.96473 EST 10.4 

Hs.271445 ESTs, Weakly similar to ALUIJiUMAN ALU S 14.4 

Hs.284228 ZNF135-like protein 10.2 

Hs.237323 N-acetylglucosamine-phosphate mutase 7.5 

Hs.173518 M^hasephosphoprotelnhomolog 52 

gb:oq30a04.s1 NCLCGAPJ3C4 Homo sapiens 46.8 

Hs.96557 Homo sapiens cDNA FU12727 fis, clone NT 5.9 

Hs.96693 ESTs, Moderately similar to 2109260A Be 7 

Hs.193985 ESTs 7.8 

Hs.30002 SH3-contalning protein SH3GLB2; KIAA1848 6.8 

gb£ST52657 Fetal heart II Homo sapiens 4.4 ■ 

Hs.99052 ESTs 5.6 

Hs.186749 ESTs, Highly slmflar to I37550 mismatch 5.4 

Hs.98019 EST 6 

Hs.194417 ESTs 13.1 

Hs.97887 ESTs 28 

Hs.181510 ESTs 6.2 

Hs 98142 EST 7.4 

gb:zt95g12.s1 Soares_testis_NHT Homo sap 2.8 

Hs.178072 Homo sapiens mRNA; cDNA DKFZp434B1023 (f 7.8 

Hs.97514 ESTs 7.1 

Hs.234545 hypothetical protein NUF2R 19.5 

Hs.158654 K1AA1196 protein 7.9 

Hs.96434 ESTs 6.8 

Hs!l91606 ESTs, Highly similar to K1AA1 048 protein 5 

Hs.98459 ESTs 5 

gb:zw50fOZs1 SoaresJotaLfetusJ^b2HF8_ 7.2 

Hs.98668 EST 6.4 

Hs.98706 ESTs 6.5 

Hs.271963 ESTs, Weakly similar to ALU5.HUMAN ALUS 13.1 

Hs.98390 ESTs 7.3 

Hs.178222 ESTs 5 

Hs.336677 EST 7.6 

Hs.99148 ESTs, Weakly similar to S43569 R01H10.6 9.7 

Hs.238151 EST . 6.1 
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122492 AA448417 Hs.104990 
122510 AA449232 Hs.99195 
122530 AW959741 Hs.40368 
122572 AA452601 Hs.99287 
5 122607 AA453518 Hs.98023 
122614 AA453630 Hs.99339 
122616 AA453638 Hs.161873 
122618 AA453641 
122622 M453987 Hs.144802 

10 122717 AA456859 Hs.178358 
122829 AW204530 Hs.99500 
122838 AA460584 Hs.334386 
122856 AI929374 Hs.75367 
122868 AF005216 Hs.115541 

15 122907 AM70074 Hs.169896 
123016 AW338067 Hs.323231 
123034 AL359571 Hs.44054 
123136 AW451999 Hs.194024 
123152 AW601773 Hs.270259 

20 123394 AA731404 Hs.105510 
123466 AA599042 Hs.112503 
123486 BE019Q72 Hs.334802 
123615 AA609170 
123735 NM_013241Hs.95231 

25 123753 AA609955 Hs.234961 
124006 Al 147155 Hs.270016 
124385 AI267847 
124440 AA532519 Hs.129043 
124656 AW297702 Hs.102915 

30 124683 AA381661 Hs.119878 
124735 R22952 Hs.268685 
124761 AA374756 Hs.93560 
124768 AW368528 Hs.100855 
124788 R43543 Hs.100912 

35 124811 R46068 Hs.288912 
124812 R47948 . Hs.188732 
124822 AA418160 Hs.86043 
124860 R65763 Hs.101477 
124903 AW296713 Hs.221441 

40 124930 AI076343 Hs.173939 
124942 R99978 Hs.268892 
125051 T79956 Hs.100588 
125056 T81310 Hs.100592 
125101 AI472068 Hs.286236 

45 125115 T97341 

125280 AI123705 Hs.106932 
.127274 AW966158 Hs.58582 
128528 R39234 Hs.251699 
128670 AA975486 Hs.103441 

50 128691 W27939 Hs.103834 
128772 BE302796 Hs.105097 
128781 N71B26 Hs.105465 
128797 NMJJ02975HS.105927 
128868 AA419008 Hs.106730 

55 128891 F34B56 Hs.292457 
128946 Y13153 Hs.107318 
128975 BE560779 Hs.284233 
128995 AI816224 Hs.107747 
129019 AI950087 

60 129076 AW296806 Hs.326234 
129088 AA744610 Hs.194431 
129096 AA463189 Hs.288906 
129198 N57532 Hs.109315 
129347 BE614192 Hs.279869 

65 129362 U30246 Hs.1 10736 
129372 NNL016039HS.1 10803 
129404 AI267700 Hs.317584 



ESTs 5.4 

ESTs 11.2 

adaptor-related protein complex 1, sigma 10.1 

EST 11 

ESTs 613 

EST ■ 10.7 

ESTs 107.3 
gb:zx48eQ6.s1 SoaresJestis_NHT Homo sap 31.1 

ESTs 5.6 

ESTs 6.5 

ESTs 81.8 

ESTs 753 

Src-iike-adapter 5.8 

Janus kinase 2 (a protein tyrosine kinas 5.3 

ESTs 113 

Homo sapiens cDNA FU1 1946 fis, clone HE 2.8 

ninein (GSK3B interacting protein) 8.7 

ESTs 5.1 

ESTs 5.2 

ESTs ! 3.6 

EST 7.4 

Homo sapiens cDNA FU14680 fis, clone NT 2.4 

gbaf12a12.s1 SoaresJestisJJHT Homo sap 7.8 

FHiyFH2domain-conta(nlngpn3te/n 10 

Huntingtin interacting protein E ' 30.6 

ESTs 8.1 

gbaq49a10.x1 Stanley Frontal NB pool 2 57.1 
Human DNA sequence from clone 989H1 1 on 7.8 

ESTs 8.3 
ESTs, Weakly similar to M3K9_HUMAN MITOG 7.9 

ESTs * 11.3 

Homo sapiens mRNA for KIAA1771 protein, 9 

ESTs 8.1 

Homo sapiens cDNA: FU22726 fis, done H 5.1 

hypothetical protein FU22604 14.2 

ESTs 7.9 

Homo sapiens cONAFLJI 3558 fis, clone PL 6.6 

EST 23.9 

ESTs 32.4 

ESTs, Weakly similar to ALUB_HUMAN till 22.8 

ESTs, Moderately similar to B34087 hypot 6.1 

EST 135.3 

ESTs 5.4 

K1AA1 856 protein 5.6 

gb:ye57e05.s1 Soares fetal liver spleen 9.6 

ESTs 8 

Homo sapiens cDNA FU12789 fis, done NT 12.8 

ESTs, Weakly similar to IDN4-GGTR14 [H.s 2.8 

Homo sapiens, Similar to RIKEN cDNA 1700 7.1 

hypothetical protein MGC5576 7.7 

thymidine kinase 1 , soluble 5.3 

small nuclear ribonucteoproteln polypept 53.9 

stem cell growth factor; lymphocyte seer 1 33 

chromosome 22 open reading frame 3 3 
Homo sapiens, clone MGC:16362, mRNA, com 13-3 

kynurenine 3-monooxygenase (kynurenlne 3 7.2 

NICE-5 protein 14 

DKFZP566C243 protein 1.9 
gbwq05c02.x1 NCI_CGAP_Kid12 Homo sapien 2.9 

ESTs, Highly similar to T46422 hypotheti 5 

palladia 17.1 

WW Domain-Containing Gene 20.9 

KIAA1415 protein 5.8 

melanoma-associated antigen recognised b 7.6 

solute carrier family 12 (sodium/potassl 6.7 

CGI-99 protein 2 

ESTs 5 
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129482 AA188185 Hs.289043 spinotn 6.7 

129559 W01296 Hs.11360 hypothetical protein FU 14784 7.5 

129587 H14718 Hs.11506 Human clone 23589 mRNA sequence 6,8 

129629 AK000398 Hs.11747 hypothetical protein FU20391 3.8 

'5 129649 AD000092 Hs.16488 cafretfculln 3.3 

1 29680 U03749 gb:Human chromogranln A (CHGA) gene, pro 14.1 

129689 AW748482 Hs.77873 B7homo!og3 ZS 

129702 AI304966 Hs.12035 ESTs, Weakly similar to I38022 hypothetl 7.4 

129720 AA156214 Hs.12152 APMCF1 protein 2 

10 130010 AA301116 Hs.142838 nucleolar phosphoproteln Nopp34 1,6 

130097 AL046962 Hs.14845 forkhead box 03A 2.8 

130135 AA311426 Hs.21635 tubulin, gamma 1 6.1 

13021 1 NMJ)03358Hs.23703 ESTs, Moderately similar to CEGTJWMAN C1.6 

130242 X79201 Hs.153221 synovial sarcoma, translocated to Xchro 5.4 

15 130359 NM_013449Hs.277401 bromodomain adjacent to zinc finger doma 8.5 

130365 W56119 Hs.155103 eukaryotlc translation initiation factor 11 

130448 BE513202 Hs.15589 PPAR binding protein 3.9 

130455 D90041 Hs.155956 N-ecetyltransferase i1 (arylamine N-acety 33.6 

130471 AL'121438 Hs.183706 adducin 1 (alpha) 2.7 

20 130503 BE208491 Hs.295112 KIAA0618 gene product 16.1 

130511 L32137 Hs.158^ cartilage ofigomeric matrix protein (pse 6.1 

130542 U64675 Hs.179825 RAN binding protein 2-iike 1 7.8 

130553 AF062649 Hs.252587 pituitary tumor-transforming 1 14.4 

130556 AI907018 Hs.15977 Empirically selected from AFFX single pr 4.7 

25 130567 AA383092 Hs.1608 replication protein A3 (14kD) 7.9 

130574 AF083208 Hs.1617fl apoptosls antagonizing transcription fac 1.2 

130617 M90516 Hs.1674 glutamine-fructose-6-phosphatetransamin 12.1 

1 30667 BE246961 Hs. 1 7639 Homo sapiens ublquitin protein ligase (U 13.9 

130693 R68537 Hs.17962 ESTs . 2 

30 130744 H59696 Hs.18747 POP7 (processing of precursor, S. cerevi 3.1 

130757 AL036067 Hs.18925 protein x 0001 5.7 

130860 BE514434 Hs.20830 kinesln-like 2 2.1 

130944 BE382657 Hs.21486 signal transducer and activator of trans 5.4 

131046 AA321649 Hs.2248 small inducible cytokine subfamily B (Cy 7.4 

35 131060 AA194422 Hs.22564 myosin VI 5.1 

131099 AL133353 Hs.226581 COX15 (yeast) homotog, cytochrome coxid 7 

131135 NMJ)16569Hs.267182 TBX3-iso protein 3.3 

131185 BE280074 Hs.23960 cydinBI 5.8 

131225 H62087 Hs.31659 thyroid hormone receptor-associated prot 7.5 

40 131245 AL080080 Hs.24766 thloredoxin domain-containing 2.8 

131283 X80038 Hs.339713 Homo sapiens done F19374APOE-C2 gene 1.3 

131569 AL389951 Hs.271623 nucleoporin 50kD 5 

131643 AW410601 Hs.30026 HSPC162 protein ZB 

131714 AA642831 Hs.31016 putative DNA binding protein 2.9 

45 131722 D13757 Hs.311 phosphoribosyl pyrophosphate amidotransf 3.4 

131760 X76732 Hs.3164 nucleoblndin 2 2.9 

131793 AW966127 Hs.32246 Homo sapiens cONA FU14656 fe, clone NT 7.9 

131885 BE502341 Hs.3402 ESTs • 13.7 

131900 AA099014 Hs.231029 Homo sapiens, clone MGC:15961, mRNA, com 8.7 

50 131905 AA179296 Hs.3439 stomatin-like 2 . 11.3 

131941 BE252983 Hs.35086 ublquitin specific protease 1 2.3 

131971 BE567100 Hs.154938 hypothetical protein MDS025 3.5 

132180 NM.004460Hs.418 fibroblast activation protein, alpha 14.7 

132203 NM.004782HS.194714 synaptosomai-assocfated protein, 29kD 7.8 

55 132273 AA227710 Hs.43658 DKFZP586L151 protein 10 

132268 N36110 Hs.305971 solute carrier family 2 (facilitated ghi 9.2 

132294 AB023191 Hs.44131 KIAA0974 protein 2 

132348 AW067708 Hs.170311 heterogeneous nuclear ribonucteoprotein 12£ 

132370 AW572805 Hs.46645 ESTs 28.3 

60 132384 AA312135 Hs.46967 HSPC034 protein 6.1 

132450 AA100012 Hs.48827 hypothetical protein FU12085 8.6 

132465 AW169847 Hs.49169 KIAA1 634 protein 6.1 

132532 AA454132 Hs.5080 mitochondrial ribosomal protein L16 7.1 

132574 AW631437 Hs.5184 TH1 drosophila homotog 14 



65 132638 A1796870 Hs.54277 DNA segment on chromosome X (unique) 99212.4 
132718 NNL004600HS.554 Sjogren syndrome antigen A2 (60kD, ribon 3.7 
132726 N52298 Hs.55608 hypothetical protein MGC955 14^ 
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132731 AI189075 Hs.301872 hypothetical protein MGC4840 5.9 

132744 AA010233 Hs.55921 glutamyl^rolyt-lRNA synthetase 6.4 

132773 M459713 Hs.295901 KIAA0493 protein 14.6 

132798 AI026701 Hs.5716 KIM0310 gene product 2.5 

132810 AB007944 ' Hs.5737. KIM0475 gene product 4.2 . 

132833 U78525 Hs.57783 eukaryoUc translation Initiation factor . 6.1 

132842 NM 016154Hs.279771 Homo sapiens done PP1596 unknown mRNA7.1 

132851 U09716 Hs.287912 lectin, mannose-blnding, 1 6.1 

132891 BE267143 Hs.59271 U2(RNU2) small nuclear RNA auxiliary fac 2.7 

132941 AI817165 Hs.6120 hypothetical protein RJ1 3222 2.1 

132972 AA034365 Hs.288924 Homo sapiens cDNA FU11392 fis, clone HE 3.5 

132980 AA040696 Hs.62016 ESTs 1.3 

132994 AA112748 Hs.279905 clone HQ0310PRO0310p1 . 17.1 

133016 AI439688 Hs.6289 hypothetical protein FU20886 4.4 

133177 X97795 Hs.66718 RAD54 (S.cerevisiae)-like 4.4 

133208 AI801777 Hs.6774 ESTs 5.5 

133254 AI567421 Hs.273330 Homo sapiens, clone IMAGE:3544662, mRNA, 1.3 

133266 AI160873 Hs.69233 zinc finger protein 16.1 

133268 AW956781 Hs.293937 ESTs, Weakly similar to FXD2JWMAN FORKH 122 

133285 M76477 Hs.289082 GM2 ganglioside activator protein 10.4 

133390 AI950382 Hs.72660 phosphatidylserine receptor 5.7 

133391 AW1 03364 Hs.727 inhibin, beta A (activin A, activln AB a 255 
133540 AL037159 Hs.74619 proteasome (prosome, macropaln) 26S subu 1.7 
133594 AW1 60781 Hs.1 72589 nuclear phosphoprotein similar to S.cer 2.6 
133621 NM_004893Hs.75258 H2A histone family, member Y 13.5 
133720 L27841 Hs.75737 pericentriolar material 1 6.7 
133760 BE271766 Hs.181357 laminln receptor 1 (67kO, ribosomal prot 5.4 
133784 BE622743 Hs.301064 arfaptln 1 12.1 
133791 M34338 Hs.76244 spermidine synthase 9.7 
133797 AL133921 Hs.76272 retinoblasioma-bindlng protein 2 1.3 
133822 D50525 Hs.699 peptidylprolyl isomerase B (cyclophilin 9.7 
133850 W29092 Hs.7678 cellular retinoic acid-binding protein 1 4.2 
133865 AB011155 Hs.170290 discs, large (Drosophiia) homotog 5 5 
133881 U30872 Hs.77204 centromere protein F (350/400kD, mitosin 9.1 
133924 D86326 Hs.325948 vesicle docking protein p1 15 1.8 
133959 X81789 Hs.77897 splicing factor 3a, subunit 3, 60kO 10.4 
133989 AL040328 Hs.78202 SWi/SNF related, matrix associated, acti 2.6 
133997 AI824113 Hs.78281 regulator of G-protein signalling 12 13 
134234 BE300078 Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, 103 
134348 AW291946 Hs.82065 interieukin 6 signal transducer (gp130, 6.7 
134376 X06560 Hs.82396 2',5'-oligoadenylate synthetase 1 (4046 5.5 
134379 AW362124 Hs.323193 hypothetical protein MGC3222 5.8 

1 34405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134421 AU077196 Hs.82985 collagen, type V, alpha 2 6.7 

134480 NM_005000Hs.83916 Empirically selected from AFFX single pr 6.2 

134516 AK001571 Hs.273357 hypothetical protein FLJ1 0709 1.4 

• 134529 AW411479 Hs.848 FK506-bindtng protein 4 (59kD) ■ 2.8 

134751 AW630803 Hs.89497 laminBI . 6.1 

134790 BE002798 Hs.287850 integral membrane protein 1 ' 12 

134806 AD001528 Hs.89718 spermine synthase 2.6 

134850 AI701162 Hs.90207 hypothetical protein MGC1 11 38 9.1 

134859 D26488 Hs.90315 KIAA0007 protein ' 13.3 

134971 A1097346 Hs.286049 phosphoserine aminotransferase 2 

135181 BE250865 Hs.279529 px19-like protein 14.9 

135207 N26427 Hs.9634 ESTs, Highly similar to C10JWMAN PUTATI 1.7 

135245 AI028767 Hs.262603 ESTs 12.2 

135257 AW291023 Hs.97255 ESTs, Weakly similar to A46010 X-Unked 7.6 

135307 AI743770 Hs.98368 ESTs, Weakly similar to KIAA0822 protein 5.8 

135321 AI652069 Hs.98614 ribosome binding protein 1 (dog 180kD ho 12.3 

135354 AA456454 Hs.183418 cell division cycle 2-like 1 (PITSLRE pr 5.7 

135400 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 13.9 

302276 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c 5.3 

317781 NMJ07057HS.42650 ZWIOinteractor 2.8 

321114 AA902256 Hs.78979 Gotgi apparatus protein 1 5.5 

322556 BE041451 Hs.177507 hypothetical protein 2.9 

420802 U22376 Hs.1334 v-myb avian myeloblastosis viral oncogen 2.3 

424001 W67883 Hs.137476 paternally expressed 10 (PEG10; K1AA105 7 
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425182 AF041259.HS.155040 

446999 AA151520 Hs.334822 

450701 H39960 Hs.288467 

452461 N78223 Hs.108106 

453157 AF077036' Hs.31989 



zinc finger protein 217 2.3 
hypothetical protein MGC4485 7.5 
Homo sapiens cDNA FU12280 fis, clone MA 5.6 
transcription factor 4.7 
DKFZP586G1722 protein 12.1 
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TABLE 5A 



Table 5 A shows the accession numbers for those pkeys lacking unigeneED's for Table 5. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from, Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number . 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

123615 30686.-15 AA609170 
124385 656394 J AI267847 N27351 

110856 19346 14 AA992380 N33063 N21418 H79958 R21911 H79957 

120472 44573^2 AI950087 N70208 R97040 N36809 AI3081 19 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 

AA908598 AA251875 A1820501 AI820532 W87891 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA8B4444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AA044209 AW856538 AA180009 AA337499 AW961 101 
AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 A1002839 R67840 AA300207 AW959581 T63226 
F04005 

129019 44573J AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 AI251473 H59397 AW971573 R97278 W01059 AW967671 

AA908598 AA251875 AI820501 AI820532 W87891 TB5904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874058 AL134043 AW235363 M663345 AW008282 . 
AA488964 AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 AI872193 AI763273 AW173586 
AW150329 AI653832 AI762688 AA988777 AA488892 AI356394 AW103813 AI539642 AA642789 AA856975 AW505512 
AI961530 AW629970 BE612881 AW276997 AW513601 AW512843 AAO44209 AW856538 AA180009 AA337499 AW961101 
AA251669 AA251874 AI819225 AW205862 AI683338 AI858509 AW276905 AI633006 AA972584 AA908741 AW072629 
AW513996 AA293273 AA969759 N75628 N22388 H84729 H60052 T92467 AI022058 AA780419 AA551005 W80701 
AW613456 AI373032 AI564269 F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 
F04005 

120695 9683J AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI740458 
A1796100 AI935603 AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 

122618 305217J AA453641 AA454061 

125115 genbankjr97341 T97341 

120809 genbanKjAA346495 AA346495 

129680 23162J U03749 NfVL001275 J03483 J03915 AI214509 AW245744 AL046455 AA318960 AI741505 AA843875 AI829382 AI560122 

AI858999 D55958 AI684005 D53170 AA854091 AI025609 D53119 D54729 D55504 D55377 D55313 AW512244 AA846441 
AW043898 AI969102 AA405741 AI091983 AA788784 AA706586 AA854361 AW470949 M843095 AA772028 AJ148432 
AI038109 AA782478 AA910064 AI220384 AA781296 AA843881 AA854064 AA843125 AA843419 AA319036 AA319054 
. AI273831 W32275 AI584185 C05724 AA789023 AI686818 D54392 AI022485 AA431410 M854232 W39212 W15214 
AA894441 AIB03081 AI167381 AW245389 AA319430 AA335156 AI042646 AA327030 AA725170 T27943 AA889304 
AA976699 AI687001 AI621107 AI865540 AA772107 C06286 AA319661 AA405992 

101045 entre^J05614J05614 

110501 genbankJH55748 H55748 

121558 genban\ J AA412497 AA412497 

121911 genbank>A427950 AA427950 
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TABLE 6: Figure 6 from BRCA 001 US 

Table 6 shows genes upregulated in tumor tissue compared to normal breast tissue. 



Pkey: Unique Eos probesel Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD UnlgeneTitle R1 

100147 D13666 Hs.136348 osteoblast specific factor 2 (fascicfln 7.5 

1 00678 AW502935 Hs.740 PTK2 protein tyrosine kinase 2 53.2 

101806 AA586894 Hs.112408 S1 00 calcium-binding protein A7(psonas 8.9 

102455 U48705 Hs.75562 discotdin domain receptor family, member 6.9 

103206 X72755 Hs.77367 monokine induced by gamma interferon 8.8 

105743 BE246502 Hs.9598 sema domain, immunoglobulin domain (Ig), 2.6 

105746 AW151952 Hs.46679 hypothetical protein FU20739 1.5 

106373 AW503807 Hs.21907 histone acetyltransferase 1.8 

110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y.HUMAN CYTOC 4.2 

119260 AK001724 Hs.102950 coat protein gamma-cop 3.2 

120206 H26735 Hs.91668 Homo sapiens clone PP1498 unknown mRNA 457 

120253 AA131376 Hs.326401 fibroblast growth factor 12B 38.9 

120297 AA191384 Hs.104072 ESTs, Weakly similar to Z195_HUMAN ZINC 15,2 

120624 AW407987 Hs.173518 M-phase phosphoprotein homolog 52. 

120695 AA976503 gb:oq30a04.s1 NCI_CGAP_GC4 Homo sapiens 463 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1848 6.8 

121508 AA402515 Hs.97887 ESTs 28 

122607 AA453518 Hs.98023 ESTs 61.5 

122616 AA453638 Hs.161673 ESTs 107.3 

122618 . AA453641 gb:zx48e06.s1 Soares_testis_NHT Homo sap 31.1 

122829 AW204530 Hs.99500 ESTs 81.8 

122838 AA460584 Hs.334386 ESTs 75.3 

123753 AA609955 Hs.234961 Huntingtin interacting protein E 30.6 

124385 AI267847 gbaq49a10j(1 Stanley Frontal NB pool 2 57.1 

124860 R65763 Hs.101477 EST 23.9 

124930 AI076343 Hs.173939 ESTs, Weakly similar to ALUBJiUMAN !!!! 22.8 

125051 T79956 Hs.100588 EST 135.3 

128781 N71826 Hs.105465 small nuclear ribonucleoprotein polypept 53.9 

129096 AA4631B9 Hs.288906 WW Domain-Containing Gene 20.9 

129347 BE614192 Hs.279869 melanoma-associated antigen recognised b 7.6 

129689 AW748482 Hs.77873 B7homolog3 2.6 

130503 BE208491 Hs.295112 KIAA0618 gene product . 16.1 

130511 L32137 Hs.1584 cartilage oligomeric matrix protein (pse 6.1 

131046 AA321649 Hs.2248 small Inducible cytokine subfamily B (Cy 7.4 

131643 AW410601 Hs.30026 HSPC1 82 protein 2.9 

131925 AF151048 Hs.183180 anaphase promoting complex subunit 11 {y 2.7 

132180 NM_004460Hs.418 fibroblast activation protein, alpha 14.7 

132370 AW572805 Hs.46645 ESTs 28.3 

132994 AA1 12748 Hs.279905 clone HQ0310 PRO0310p1 17,1 

133016 AJ439688 Hs.6289 hypothetical protein FU20886 4.4 

133266 AI160873 Hs.69233 zinc finger protein . 16.1 

133391 AW1 03364 Hs.727 inhibin, beta A (activin A, actMn AB a 25.5 

134169 AI690916 Hs.178137 transducer of ERBB2, 1 1.2 

134219 NrVL000402Hs.80206 glucose-6-phosphate dehydrogenase 1.9 

134405 AW067903 Hs.82772 collagen, type XI, alpha 1 72.9 

134529 AW411479 Hs.848 FK506-binding protein 4 (59kD) 2.8 

134975 R50333 Hs.92186 Leman cofled-coil protein 2.6 

135181 BE250865 Hs.279529 px19-like protein 14.9 

322556 BE041451 Hs,177507 hypothetical protein 2.9 
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TABLE 6A 



Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. For 
5 each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey CAT number Accessions 
124385 656394J AI267847 N27351 

120695 9683 3- AA976503 AI917802 AA953664 AA404613 AA428771 BE280542 AW194691 AI927301 AI74Q458 A1796100 AI935603 

AW052210 AA970201 AI633384 AA425910 AI017004 AI241295 AA402816 AA291468 
122618 305217J AA453641 AA454061 



10 



Pkey. 

CAT number. 
Accession: 



Unique Eos probeset Identifier number 
Gene duster number 
Genbank accession numbers 
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TABLE 7: Figure 7 from BRCA 001-1 US 

» 

Table 7 shows genes upregulated in tumor tissue compared to normal breast tissue. Open 
reading frames in the sequences have been characterized as having a signal sequence (SS), a 
transmembrane domain (TM) or other. 



10 Ptey: 
ExAccn: 
UntgeneiD: . 
Unlgene Title: 

R1: 



20 



25 



30 



35 



40 



45 



50 



55 



60 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unlgene number 

Unlgene gene title 

Ratio of tumor to normal breast tissue 



15 ORF struct Info: Structural characterization of open reading frame for the sequence of the gene 



Pkey ExAccn UnigeneiD UnfgeneTitle 



100113 

100114 

100131 

100146 

100147 

100154 

100163 

100220 

100265 

100271 

100275 

100323 

100335 

100364 

100372 

100393 

100400 

100418 

100482 

100518 

100666 

100667 

100668 

100678 

100783 

100892 

100945 

100969 

100988 

100999 

101031 

101045 

101077 

101093 

101186 

101216 

101228 

101247 

101249 

101332 

101352 

101396 

101445 

101470 

101478 



NNL001269 

X02308 

D12485 

BE185499 

D13666 

H60720 

W44671 

AW015534 

D38521 

BE160081 

BE242802 

D50920 

AW247529 

NM.004341 

NMJJ14791 

D84145 

AW954324 

D86978 

M65028 

NMJJ04415 

L05424 

L05424 

L05424 

AW502935 

AF078847 

BE245294 

AF002225 

AA157634 

AK000405 

H38765 

J05070 

J05614 

N99692 

L06419 

AA020956 

AA284166 

AA333387 

AA132666 

L18964 

J04088 

AI494299 

BE267931 

M21259 

NM.000546 

NMJ02890 



Hs.84746 chromosome condensation 1 

Hs.82962 thymldylate synthetase 

Hs.11951 ectonudeotidepyrophosphatase/phosphodl 

Hs.2471 K1AA0020 gene product 

Hs.136348 osteoblast specific factor 2 (fascldin 

Hs.81892 KIAA0101 gene product 

Hs. 1 24 gene predicted from cDNA with a complete 

Hs.217493 annexinA2 

Hs.112396 KIAA0077 protein 

Hs.256290 S100 cateium-binding protein A11 (catglz 

Hs.154797 K1AA0090 protein 

Hs.23106 KIAA0130 gene product 

Hs.6793 platelet-activating factor acetylhydrola 

Hs.1 54868 carbamoyl-phosphate synthetase 2, aspart 

Hs.184339 KIAA0175 gene product 

Hs.3991 3 novel RGD-contatning protein 

Hs.75790 phosphatidyllnositol glycan, class C 

Hs.84790 KIAA0225 protein 

Hs.81361 heterogeneous nuclear ribonucleoproteln 

Hs.74316 . desmoplakin {DPI, DPII) 

Hs.169610 CD44 antigen (homing function and Indian 

Hs.169610 CD44 antigen (homing function and Indian 

Hs.169610 CD44 antigen (homing function and Indian 

Hs.740 PTK2 protein tyrosine kinase 2 

Hs.1 91356 general transcription factor tlH, polype 

Hs.180789 S164 protein 

Hs.180686 ubiquitln protein ligase E3A (human papi 

HsJ9172 solute carrier famBy 25 (mitochondrial 

Hs.76480 ublquitin-Ilke4 

Hs.80706 diaphorase (NADHiNADPH) (cytochrome k5 

Hs.151738 matrix metailoproteinase 9 (gelatinase B 

gb:Human proliferating cell nuclear anti 

Hs.75227 Empirically selected from AFFX single pr 

Hs.75093 procollagen-lysine, 2-oxoglutarate 5-dio 

Hs.179881 core-binding factor, beta subunH 

Hs.841 13 cydln-dependent kinase inhibitor 3 (CDK 

Hs.82916 chaperonin containing TCP1, subunlt 6A ( 

Hs.78802 glycogen synthase kinase 3 beta 

Hs. 1 904 protein kinase C, iota 

Hs.156346 topoisomerase (DNA) II alpha (170kD) 

Hs.16297 COX17 (yeast) homotog, cytochrome c oxld 

Hs.78996 proliferating cell nudear antigen 

gb:Human Alu repeats In the region 5* to 

Hs.1846 tumor protein p53 (U-Fraumeni syndrome) 

Hs.758 RAS p21 protein activator (GTPase acthra 
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1.5 


other 


13.5 


other 


5.1 


other 


1.9 


TM 


2.7 


other 


2 


other 


2.6 


other 


3.2 


other 


1.5 


other 


2 


other 
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other 
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other 
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other 
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other 
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other 
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1.5 


other 


6.3 


other 


11.4 
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1.6 


other 


8.4 


other 
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2.6 


other 


1.4 
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2 


TM 


1.8 


other 


1.7 


TM 


1.9 


other 
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other 


5.3 


other 


4.2 


other 


1.9 


TM 


1.6 


TM 


2.5 


other 


5.5 


other 
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01483 M24486 


Hs.76768 


procoHagen-pro!)ner2-oxoglula/ate 4-di 


2.1 


other 


01540 J04977 


Hs.84981 ' 


X-ray repair complementing defective rep 


1.6 


other 


01573 AW248421 


Hs.250758 


proteasome (prosome, macropain) 26S subu 


5.7 
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AB037742 

Y10043 

AL043114 

AK001404 

AW390282 

AB040916 

AW748420 

AF1 19256 

D63078 

AA243837 

AK000933 

AA458882 

NM„003595 
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D60341 
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N31256 

AA083069 
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Hs.24336 KIAA1321 protein 

Hs.19114 high-mobility group (nonhistone chromoso 

Hs.22410 ESTs, Weakly similar to A54849 collagen 

Hs.194698 cyclinB2 

Hs.31 130 transmembrane 7 superfamily member 2 

Hs.24106 KIAA1483 protein 

Hs.6236 Homo sapiens cDNA: FU21487 frs, clone C 

Hs.27801 zinc finger protein 278 

Hs.186180 . Homo sapiens cDNA: FU23038 (is, clone L 

Hs.57787 ESTs 

Hs.28561 Homo sapiens cDNA FU10071 fis, clone HE 

Hs.79732 fibulin 1 

Hs.26350 tyrosylproteinsulfotransferase2 

Hs.22370 Homo sapiens mRNA; cDNA DKFZp564O0122 (f 

Hs.184164 ESTs, Moderately similar to S65657 alpha 

Hs.184352 hypothetical protein FU12549 

Hs.239489 TIA1 cytotoxic granule-associated RNA-bl 

Hs.21857 ESTs 

Hs.293753 Bcl-2-related ovarian killer protebviik 

Hs .27099 hypothetical protein FLJ23293 similar to 

Hs.29463 centrin, EF-hand protein, 3 (CDC31 yeast 

Hs.34892 KIAA1323 protein 

Hs.300531 hypothetical protein 

Hs.11197 Homo sapiens, clone IMAGE:3343149, mRNA, 

Hs.9567 GL002 protein 

Hs.222024 transcription factor BMAL2 

Hs.296323 serum/glucocorticoid regulated kinase 

Hs.6294 hypothetical protein DKFZp434L1435 simil 

Hs.11923 hypothetical protein 

Hs.8688 ESTs 

Hs.300700 • hypothetical protein FU20727 

Hs.288971 myeloid/lymphoid or mixed-lineage leukem 

Hs.35198 ectonucleotide pyrophosphatase/phosphodi 

Hs.23900 GTPase activating protein 

Hs.69388 hypothetical protein FU20505 

Hs.8207 GK001 protein 

Hs.5198 Down syndrome critical region gene 2 

Hs.8687 ESTs 

Hs.7946 KIAA1288 protein 

Hs.25338 ESTs 

Hs.64639 glioma pathogenesls-related protein 

Hs.81915 leukemia-associated phosphoprotein p1 8 ( 

Hs.34727 ESTs, Moderately similar to I38759 zinc 

Hs.315111 nuclear receptor oo-repressor/HDAC3 comp 

Hs.21 198 translocase of outer mitochondrial membr 

Hs.49136 ESTs, Moderately similar to ALU7 JWMAN A 

Hs.30661 hypothetical protein MGC4606 

Hs.193700 Homo sapiens mRNA; cDNA DKF2p586l0324 (f 

Hs.96448 zinc finger protein 193 

Hs.267632 TATA element modulatoty factor 1 

Hs.279766 kinesin family member 4A 

Hs.296585 nucleolar protein (KKE/D repeat) 

Hs.59844 ESTs 

Hs!49136 ESTs, Moderately similar to ALU7JWMAN A 

Hs.303055 ESTs, Weakly similar to ALU1 JiUMAN ALU S 

Hs.47584 potassium voltage-gated channel, delayed 

Hs.335952 keratin 6B 

Hs.61460 lg superfamity receptor LNIR 

Hs.61712 pyruvate dehydrogenase kinase, Isoenzyme 

Hs.159971 SWl/SNF related, matrix associated, acU 

Hs.59847 ESTs 

Hs.272027 F-box only protein 5 

Hs.161623 ESTs 

Hs.339659 ESTs 

Hs.69476 Homo sapiens cDNA FU12758 fis, done NT 

Hs.182685 . ESTs 

Hs.69507 ESTs 
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108647 BE546947 Hs.44276 homeoboxCIO — - 

108695 AB029000 Hs.70823 ■ . KIAA1077 protein 

108740 AI089575 Hs.9071 progesterone membrane binding protein 

108828 AK001693 Hs.273344 DKFZP564O0463 protein 

5 108859 AL121500 Hs.178904 ESTs 

108872 H06720 Hs.111680 endosufflne alpha 

108891 AI801235 Hs.48480 ESTs 

108894 AK001431 Hs.5105 hypothetical protein FU10569 

108955 AA149754 Hs.195155 Homo sapiens amino acid transport system 

10 108982 AA151708 Hs.171980 homeo box (expressed in ES ceils) 1 

108987 AA152178 Hs.23467 ' hypothetical protein RJ1 0633 

109002 AB028987 Hs72134 KIAA1 064 protein 

109011 AA156542 Hs72127 ESTs 

109026 AA157811 , gbzo35d07.s1 Stratagene colon (937204) 

15 109068 AA164293 Hs.72545 ESTs 

109101 AW608930 Hs.52184 hypothetical protein FU20618 . 

109112 AW419196 Hs.257924 hypothetical protein RJ13782 

109124 AK000684 Hs.1838B7 hypothetical protein FUZH04 

109139 AJ132592 Hs.59757 zinc finger protein 281 

20 109166 AA219691 Hs.73625 RAB6 interacting, kinesMike (rabWnes 

109198 BE566742 Hs.58169 highly expressed in cancer, rich in leuc 

109213 NNL016603 Hs.82035 potential nuclear protein C50RF5; GAP-fl 

109220 AW958181 Hs.189998 ESTs 

109233 . AU077281 Hs.170285 nucleoporin 214kD (CAIN) 

25 109270 N99673 Hs.3585 - EST s, Weakly similar to AF126743 1 ONAJ 

109273 AA375752 Hs.82719 Homo sapiens mRNA; cDNA DKFZp586F1822 (f 

109313 AF153201 Hs.86276 C2H2 (KruppeWype) zinc finger protein 

109341 AA213506 Hs.115099 EST 

109391 AL096858 Hs. 184245 K1AA0929 protein Msx2 interacting nuclea 

30 109420 H83603 Hs.40408 homeo box C9 

109426 N30531 Hs.42215 protein phosphatase 1, regulatory subunl 

109429 AI160029 Hs.61438 ESTs 

109445 AA232103 Hs.189915 ESTs 

109450 AB032969 Hs.173042 KIAA1 143 protein 

35 109468 NIVL015310 Hs.6763 KIAA0942 protein 

109478 AW074143 Hs.87134 ESTs 

109570 L40027 Hs.118890 glycogen synthase kinase 3 alpha 

109662 F02614 . Hs.27319 ESTs 

109825 R71264 Hs.16798 ESTs 

40 110039 H11938 Hs.21907 histone acetyltransferase 

110056 AA503041 Hs.279009 matrix Gia protein 

110085 AA603840 Hs.29956 KIAA0460 protein 

110110 T07353 H37948 ESTs 

110129 R51853 Hs.226429 ESTs, Weakly similar to ALU1_HUMAN ALU S 

45 110154 NM 014521 Hs.17667 SH3-domain binding protein 4 

110240 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y.HUMAN CYTOC 

110242 N41744 Hs.19978 CGI-30 protein 

110259 H28428 Hs.32406 ESTs, Weakly similar to 138022 hypotheti 

110312 BE256986 Hs.11896 hypothetical protein FU12089 

50 110501 H55748 gb:yq94a01.s1 Soares fetal liver spleen 

110504 H55915 Hs.210859 hypothetical protein FU1 101 6 

110525 H57330 Hs.37430 EST 

110568 AK001160 Hs.5999 hypothetical protein FU10298 

110699 797586 Hs.18090 ESTs 

55 110705 AB007902 Hs.32166 KIM0442 protein 

110742 AW190338 Hs.28029 hypothetical protein MGC1 1256 

110761 AL138077 Hs.16157 hypothetical protein FU12707 

110762 BE044245 Hs.30011 hypothetical protein MGC2963 
110765 AK000322 Hs.18457 hypothetical protein FU20315 

60 110769 BE000831 Hs.23837 Homo sapiens cDNA FU11812 fis, clone HE 

110799 AI089660 Hs.323401 dpy-30-like protein 

110805 T25829 Hs.24048 FK506 binding protein precursor 

110813 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 

1 10820 R33261 Hs.6614 ESTs, Weakly similar to A43932 mucin 2 p 

65 110840 N31598 Hs.12727 hypothetical protein FU21 610 

110844 AI740792 Hs.167531 methylcrotonoyt-Coenzyme A carboxylase 2 

110854 BE612992 Hs.27931 hypothetical protein FU10607 similar to 

156 



0 o 
9.0 


nthor 


71 
/ .0 


ouicr 


AO 


o 
r 


1 Q 


omer 


1.0 


TM 




OUlBl . 


C A 
OA 


omer 


A i 
1.1 


T1LJ 

l WI 


f\ 7 


9 


1 7 


ouier 


D.O 


omer 


4 7 


omer 


1.0 


omer 


0.4 


oiner 


o 
O 


oiner 


1.0 


cc 


3.3 


TM 


4 T 

1.7 


otner 


2.7 


otner 


3 


TM 


A 4 

2.1 


-ILL, 

other 


e A 

5.4 


other 


5.8 


other 


5.3 


other 


4 A 

1.4 


omer 


3 


other 


4 A 

1.3 1 


otner 


3 


A 

7 


4 r 

1.5 


other 


i 

2.2 


SS, 


3.1 


til 

TM 


A 

2 


A 

7 


4 O 

1.8 


other 


A O 

3.8 


other 


A O 

3.3 


other 


A 

2 


TM 


A 4 

2.1 


otner 


4 A 

1.4 


otner 


1.3 


otner 


A i 

2 


otner 


2-5 


otner 


1.7 


otner 


A A 

2.9 


otner 


4 7 

1.7 , 


SS, 


A A 

4.3 


■ 11. j ,r 

otner 


A A 

4.3 


A 

7 


A A 

1.3 


other 


A A 

2.2 


other 


A 4 

2.1 


otner 


6.1 


A 

7 


*rs At 

6.1 


TM 


6.4 


other 


1.3 


? 


1.8 


other 


1.6 


TM 


7.8 


other 


2JS 


other 


9.3 


? 


5.5' 


SS. 


2.1 


TM 


IjS 


TM 


6.7 


TM 


5.7 


other 


3.4 


other 


1.7 


TM 


1.7 


other 


4.7 


other 



WO 02/059377 



t* 4 



PCT/US02/02242 



110856 AA992380 gb:ot37g06.s1 SoaresJestisJJtfT Homo sap 

110885 BE384447 Hs.16034 hypothetical protein MGC13186 

110897 AL117430 Hs.6880 DKFZP434D1 56 protein 

110915 BE092285 Hs.29724 hypothetical protein FU13187 

5 110918 H04360 Hs.24283 ESTs, Moderately similar to reduced expr 

1 10958 NM.005864 Hs.24587 signal transduction protein (SH3 contain 

110963 AK002180 Hs.11449 DKFZP56401 23 protein 

1 10981 AK0Q1980 Hs.24284 ADP-ribosyitransferase (NAD+; pory(ADP-r 

110984 AW613287 Hs.80120 UDP-N-acetyklpha-D-galactosamine:poiyp 

10 111125 N63623 Ks.269115 EST s, Moderately similar to Z195_HUMAN Z 

111132 AB037807 Hs.83293 hypothetical protein 

111164 N46160 Hs.122489 Homo sapiens cDNAFU13289fis, clone OV 

111172 R67419 Hs.21851 Homo sapiens cDNA FLJ12900 fis, done NT 

111174 AL050166 Hs.26295 Homo sapiens mRNA; cDNA DKF2p586D1122 (f 

15 111179 AKO0O136 Hs.10760 asporin (LRR class 1) 

111184 AI815486 Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HE 

111189 N67603 Hs.272130 ESTs, Weakly similar to S65824 reverse t 

111216 AW139408 Hs.152940 ESTs 

111221 AB037782 Hs.15119 KIAA1361 protein 

20 111223 AA852773 Hs.334838 KIAA1866 protein 

111239 N90956 Hs.17230 hypothetical protein FU22087 

111285 AA778711 Hs.4310 eukaryotic translation initiation factor 

111299 AB033091 Hs.74313 KIAA1265 protein 

111312 AI523913 Hs.34504 ESTs 

25 111318 T99755 Hs.334728 ESTs 

111337 AA837396 Hs.263925 LIS1 -interacting protein NUDE1, rat homo 

111352 H58589 Hs.35156 Homo sapiens cDNA FU1 1027 fis, clone PL 

111370 AI478658 Hs.94631 brefeldin A-inhibited guanine nucleotide 

111384 N94606 Hs.288969 HSCARG protein 

30 111389 AK000987 Hs.169111 oxidation resistance 1 

111452 R02354 Hs.15999 ESTs 

111486 AI051194 Hs.227978 EST 

111549 W90638 Hs.20321 ESTs, Moderately similar to ZRF1 JiUMAN Z 

111585 R10720 Hs.20670 EST 

35 111627 R52656 Hs.21691 ESTs . 

111870 AB037834 Hs.18685 Homo sapiens mRNA for KIAA1413 protein, 

111937 BE298665 Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 {fr 

111944 AW083791 Hs.21263 suppressor of potassium transport defect 

111987 NM 015310 Hs.6763 KIAA0942 protein 

40 112134 R41823 Hs.7413 ESTs; cateyntenin-2 

112244 AB029000 Hs.70823 KIAA1077 protein 

112388 R46071 Hs.301693 Homo sapiens, clone IMAGE:3638994, mRNA, 

112456 NWL016248 Hs.232076 A kinase (PRKA) anchor protein 11 

112464 AWQ07287 Hs.28538 Homo sapiens cDNA: FU21086 fis, clone C 

45 112506 AI742756 Hs.26079 ESTs 

112513 R68425 Hs.13809 hypothetical protein FU 10648 

.112752 AK001635 Hs.14838 hypothetical protein FLJ10773 

112884 AK000004 Hs.5013 Homo sapiens mRNA for FU00004 protein, 

112923 T10258 . Hs.5037 EST 

50 112936 AW970828 Hs.6185 KIAA1557 protein 

112958 R61388 Hs.6724 . ESTs 

112966 Z44718 Hs.1 02548 glucocorticoid receptor DNA binding fact 

112978 AK000272 Hs.7099 hypothetical protein FLJ20265 

112995 AA737033 Hs.7155 ESTs, Moderately similar to 21 15357A TYK 

55 112996 BE276112 Hs7165 ■ zinc finger protein 259 

113047 AI571940 Hs.7549 ESTs 

113049 AW965190 HsJ560 Homo sapiens mRNA for WAA1729 protein, 

113089 T40707 Hs270B62 ESTs 

113196 T57317 gb:yb51a03.s1 Stratagene fetal spleen {9 

60 113248 T63857 gb:yc16e01.s1 Stratagene lung (937210) H 

113254 AK0O2160 Hs.11449 DKFZP5640123 protein 

- 113277 AW971049 Hs.11774 protein (peptidyl-prolyl cis/trans borne 

113429 AA688021 Hs.179808 ESTs 

113499 AI467908 Hs.8882 ESTs 

65 113547 H59588 Hs.15233 ESTs 

113647 AA813887 Hs.188173 Homo sapiens cDNA FU1 2187 fis, clone MA 

113702 T97307 gfcye53h05.s1 Soares fetal liver spleen 
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gb:zt2yi02^1 Soares ovary tumor nohui n 
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hypothetical protein FU11301 
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Hs.122579 


hypothetical protein FU10461 
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15400 AI215069 


Hs.89113 


ESTs 
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Hs.48499 


tumor antigen SLP<8p 
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15471 AK001376 


Hs.59346 


hypothetical protein FU10514 
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15479 AW301608 
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ESTs, Moderately similar to 154374 gene 
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eukaryotic translation initiation factor 
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ESTs 
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115655 AL048269 Hs.288544 Homo sapiens, clone MGC:16063, mRNA, com 

115663 AI138785 Hs.40507 ESTs 

115676 AA953006 Hs.88143 ESTs 

115690 AA625132 Hs.44159 hypothetical protein FU21615 

5 . 115693 AF231023 Hs.55173 cadherin, EGF LAG seven-pass G-type rece 

1 15715 BE395161 Hs.1390 proteasome (prosome, macropain) subunlt, 

115734 AI950339 Hs.40782 ESTs 

115811 NWL015434 Hs.48604 DKFZP434B168 protein 

115823 AI732742 Hs.87440 ESTs 

10 115837 AI675217 Hs.42761 ESTs 

115844 A1373052 Hs.332938 hypothetical protein MGC5370 

115866 AW062629 Hs.52081 KIAA0867 protein 

115875 N55669 Hs.333823 mitochondrial ribosomal protein L13 

115941 AI867451 Hs.46679 hypothetical protein FU20739 

15 1 15968 AB037753 Hs.62767 KIAA1332 protein 

116003 BE275469 Hs.66493 Down syndrome critical region gene 5 

116011 AL359053 Hs.57664 Homo sapiens mRNA full length Insert cDN 

116108 AA770688. Hs.28777 H2A histone family, member L 

116134 BE243834 Hs.50441 CGI-04 protein 

20 116189 N35719 Hs.44749 ESTs, Moderately similar to T00358 hypot 

116195 AW821113 Hs.72402 ESTs 

116238 AV660717 Hs;47144 DKFZP586N081 9 protein 

116246 AF265555 Hs.250646 baculovirai IAP repeat-containing 6 

116262 AI936442 Hs.59838 hypothetical protein FU 10808 

25 116298 AI955411 Hs.94109 Homo sapiens cDNA FU13634 fis, clone PL 

116318 AF097645 Hs.58570 deleted in cancer 1 ; RNAhelicaseHDB/DI 

116325 AI472106 Hs.49303 Homo sapiens cDNA FU1 1663 fis, done HE 

116336 AL133033 Hs.4084 KIAA1025 protein 

116339 AK000290 Hs.44033 dipeptidyt peptidase 8 

30 116350 AA497129 Hs.184771 nuclear factor l/C (CCAAT-blnding iransc 

116358 AI149586 Hs.38125 interferon-induced protein 75, 52kD 

116365 N50174 Hs.46765 ESTs 

116368 N90466 Hs.71109 KIAA1229 protein 

116417 AW499664 Hs.12484 Human done 23826 mRNA sequence 

35 116436 AA161411 Hs.58668 chromosome 21 open reading frame 57 

116462 AF218313 Hs.236828 . putative heiicase RUVBL 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 

116575 AA312572 Hs.6241 phosphofnositide-3-klnase, regulatory su 

116637 AKD01043 Hs.92033 integrMnked klnase-associated serine 

40 116640 X89984 Hs.211563 B-cell CLL/rymphoma 7A 

116700 AI800202 Hs.317589 hypothetical protein MGC10765 ' 

116705 AW074819 Hs.12313 hypothetical protein RJ 14566 

116732 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypotheti 

116926 H73608 Hs.290830 ESTs 

45 117034 U72209 Hs.180324 YY1-assodated factor 2 

117132 A1393666 Hs.42315 p10-binding protein 

117247 N21032 gb^x46f06^1 Soares melanocyte 2NbHM Ho 

117276 N71183 Hs.121806 Homo sapiens cDNAFU 11971 lis, clone HE 

117284 AK001701 Hs.183779 Homo sapiens cDNA FU 10590 fis, done NT 

50 117367 AI041793 Hs.42502 ESTs 

117368 AI878942 Hs.90336 ATPase, H+ transporting, lysosomal (vacu 

117382 AF150275 Hs.40173 ESTs 

117412 N32536 Hs.42645 solute carrier family 16 (monocarboxyilc 

117557 AF123050 Hs.44532 dlublquitin 

55 117588 N34895 Hs.44648 ESTs 

117745 BE294925 Hs.46680 . CGI-1 2 protein 

117754 AA121673 Hs.59757 zinc finger protein 281 

117879 N54706 Hs.303025 chromosome 11 open reading frame 24 

117904 BE540675 Hs.332938 hypothetical protein MGC5370 

60 117911 AL137379 Hs.47125 hypothetical protein FU13912 

117933 Y10518 Hs.116470 hypothetical protein FU20048 

117983 AL1 10246 Hs.47367 KIAA1785 protein 

118078 N54321 Hs.47790 EST 

118301 AA453902 Hs.293264 ESTs 

65 118429 AA243332 Hs.74649 cytochrome c oxidase subunit Vic 

118472 AL157545 Hs.42179 bromodomaln and PHD finger containing, 3 

118488 AJ277275 Hs.501O2 rapa-2 (rapa gene) 
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118656 
118670 
118698 
118737 
118925 
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118986 
119206 
119235 
119265 
119279 
119298 
119338 
119403 
119478 
119486 
119513 
119601 
119602 
119676 
119682 
119774 
119780 
119789 
119805 
119818 
119863 
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120132 
120206 
120248 
120269 
120274 
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120296 
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N22617 

AI949952 

AI458020 

AA332845 

AB033113 

AA199686 

N92293 

A1668709 

AF148713 

W24781 

AW453069 

BE539706 

N57568 . 

NM.001241 

AI417240 

AL1 17554 

AI624342 

AI796730 

W37933 

AK000155 

AW675298 

AA243837 

W61019 

AB032977 

NM.016625 

BE393948 

AJ223810 

AA130970 

AA081218 

AW449064 

AA703129 

W57554 

H26735 

A1924294 

AW131940 

AA177051 

AA190577 

AW995911 

AA191384 

M195517 

AA195651 . 

AK000292 

N85785 

AW450669 

AA210722 

AW969481 

R06859 

AF000545 

AA219305 

AA228026 

AL109963 

AW969665 

AA232874 

AW967985 

AA134006 
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AA236453 

AI950087 

AA251973 

AA253170 

AA256837 

BE047718 

AA258601 

BE350244 

AA279160 

AA280679 



Hs.43228 

Hs.49397 

Hs.293287 

Hs.152618 

Hs.50187 

Hs.206832 

Hs.240722 

Hs.125830 

Hs.293798 

Hs.3657 

Hs.285363 

Hs.48028 

Hs.155478 

Hs.320836 

Hs.119908 

Hs.170042 

Hs.55513 

Hs.91684 

Hs.233694 

Hs.57787 

Hs.57811 

Hs.6298 

Hs.191381 

Hs.50915 

Hs.43213 

Hs.58382 

Hs.58608 

Hs.119571 

Hs.58963 

Hs.125019 

Hs.91668 

Hs.173259 

Hs.104030 



Hs.299883 

Hs.104072 

Hs.191643 

Hs.104106 

Hs.278732 

Hs.181165 

Hs.45068 

Hs.104158 

HS.551B9 

Hs.193172 

Hs.296433 

Hs.104196 

Hs.38774 

Hs.123122 

Hs.154848 

Hs.104245 

Hs.325572 

HS79306 

Hs.96427 

Hs.26613 

Hs.18978 

Hs.269988 
Hs.96473 

Hs.96545 

Hs.161731 

Hs.96547 

Hs.111407 

Hs.271445 



Homo sapiens cDNA FU 1 1835 fo, done HE 

ESTs 

ESTs 

ESTs, Moderately similar to ZN91JWMAN 2 
K1AA1287 protein 

gb:zq75g09.r1 Stratagene hNT neuron {937 
ESTs, Moderately similar to ALU8_HUMAN A 
ESTs, Moderately similar to ALU8_HUMAN A 



KIAA1710 protein 

activity-dependent neuroprotective prate 

ESTs. 

EST 

cycDnT2 

ESTs, Weakly similar to A47582 B-cell gr 
nucleolar protein NOP5/NOP58 
ESTs 
ESTs 

Empirically selected from AFFX single pr 

Homo sapiens mRNA; cDNA DKF2p667l103 (fr 

hypothetical protein FU11350 

ESTs 

ESTs 

KIAA1151 protein 
hypothetical protein 

kallikrein 5 (KLK5; KLK-L2; stratum com 

ESTs, Weakly similar to IEFS_HUMAN TRANS 

hypothetical protein FU11101 

Homo sapiens cDNA RJ14206 te, clone NT 

collagen, type III, alpha 1 (Ehlers-Danl 

ESTs 

lymphoid nudear protein (LAF-4) mRNA 
Homo sapiens done PP1498 unknown mRNA 
uncharacterized bone marrow protein BM03 
ESTs 

gb:hc02a02.s1 NCI_CGAP_Pr3 Homo sapiens 

gbzp52g02.s1 Stratagene HeLa cell s3 93 

hypothetical protein FU23399 

ESTs, Weakly similar to Z195JWMAN ZINC 

ESTs 

ESTs 

hypothetical protein FU20285 
eukaryotic translation elongation factor 
hypothetical protefh DKFZp434l143 
ESTs 

hypothetical protein 

ESTs, Weakly similar to 138022 hypotheti 

putative purinergic receptor 

EST 

ESTs 

FSH primary response (LRPR1, rat) homoto 
hypothetical protein DKFZp434D0127 
ESTs 

ESTs, Moderately similar to ALU7.HUMAN A 
eukaryotic translation initiation factor 
K1AA1013 protein 

Homo sapiens mRNA; cDNA DKFZp586F1323 (f 
Homo sapiens cDNA: FU22822 fis, clone K 
gbwq05c02j(1 NCLCGAPJ<id12Homosapien 
ESTs 
EST 

gb:zr84d10.s1 Soares_NhHMPu_S1 Homosapl 

ESTs 

EST 

ESTs 

Homo sapiens, done IMAGE:3613029, mRNA, 
ESTs, Weakly similar to ALU1JHUMAN ALU S 
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120582 BE244830 Hs.284228 2NF135-Bke protein 

120590 AW372799 Hs.125790 teudne-rich repeat-containing 2 

120598 AA282074 Hs.237323 N^cetylglucosamine-phosphatemutase 

120619 AW965339 Hs.1 11471 ESTs 

120624 AW407987 Hs.173518 M^ihase phosphoproteln homolog 

120639 AA286942 gb:zs56f05.s1 NCLCGAP_GCB1 Homo sapiens 

120648 AA287095 Hs.140309 . Homo sapiens, clone IMAGE:3677194, mRNA, 

120653 AW063659 Hs.191649 ESTs 

120668 AW969638 Hs.112318 6.2 kd protein 

120669 BE536739 Hs.109909 ESTs 

120695 AA976503 ' gb:oq30a04.s1 NCLCGAP.GC4 Homo sapiens 

120696 AI821539 Hs.97249 ESTs, 

120713 AW449855 Hs.96557 Homo sapiens cDNAFU 12727 fls, clone NT 

120718 AA292747 Hs.97296 ESTs 

120750 AM91410 ' Hs.96693 ESTs, Moderately similar to 21 09260A Be 

120774 AI608909 Hs.193985 ESTs 

120807 AA346385 Hs.30002 SH3-containing protein SH3GLB2; KIAA1848 

120809 AA346495 gb:EST52657 Fetal heart II Homo sapiens 

120938 AA386260 Hs.104632 EST 

120977 AA398155 Hs.97600 ESTs 

120984 BE262951 Hs.9$052 ESTs 

120985 AI219896 Hs.97592 ESTs 
121011 AA398360 Hs.97608 EST 
121C^6 AI439713 Hs.165295 ESTs 

121081 AA398721 Hs. 1 86749 . ESTs, Highly similar to I37550 mismatch 

121133 AA363307 Hs.97032 ESTs 

121176 AL121523 Hs.97774 ESTs 

121223 AI002110 Hs.97169 ESTs, Weakly similar to (JJ667H12.2.1 [H. 

121320 AA403008 Hs.301927 c6.1A 

121340 AW956981 Hs.97910 Homo sapiens cDMA FU13383 fis, done PL 

121408 AA406137 Hs.98019 EST 

121439 AA410190 Hs.98076 ESTs, Weakly similar to A47582 B-ceB gr 

121450 AA406430 Hs.105362 Homo sapiens, done MGC.18257, mRNA, com 

121452 AW971063 Hs.292882 ESTs 

121455 H58306 Hs.15165 retinoic add Induced 14 

121457 WO7404 Hs. 144502 hypothetical protein FLJ22055 

121496 AA442224 Hs.97900 ESTs 

121505 AA494172 Hs.194417 ESTs 

121508 AA402515 Hs.97887 ESTs 

121513 AA416653 Hs.181510 ESTs 

121514 AA412112 gb:zt69b02.s1 SoaresJestis_NHT Homo sap 
121549 AA412477 Hs.98142 EST 

121558 AA412497 gb:zt95g12.s1 SoaresJestisJIHT Homo sap 

121577 AA411970 Hs.98096 EST 

1 21581 AA416568 gb:zu05c10.s1 SoaresJestisJIHT Homo sap 

121589 AD001 528 Hs.89718 spermine synthase 

121594 AA626010 Hs.98247 ESTs 

121622 AA416931 Hs.126065 ESTs 

121655 AA421537 Hs.178072 Homo sapiens mRNA; cDNA OKFZp434B1023 (f 

121682 AA418160 . Hs.86043 Homo sapiens cDNA FU13558 fis, done PL 

121690 AV660305 Hs.110286 ESTs 

121706 U55184 Hs.154145 hypothetical protein FU11585 

121714 AA419225 Hs.98269 Homo sapiens cDNA FU1 1953 fis, clone HE 

121729 AI949597 Hs.98325 ESTs 

121731 AA421041 Hs.180744 ESTs 

121744 AA398784 Hs.97514 ESTs 

121748 BE536911 Hs.234545 hypothetical protein NUF2R 

121773 AB033022 Hs.158654 KIAA1 196 protein . 

121775 AA421773 Hs.161008 ESTs 

121776 AA292579 Hs.125133 hypothetical protein RJ22501 
121786 AI810774 Hs.98376 ESTs 

121832 AW340797 Hs.98434 ESTs 

121B36 AA328348 Hs.218289 ESTs 

121839 AA425691 Hs.191606 ESTs, Highly similar to KIAA1048 protem 

121842 AF027406 Hs.104865 serine/threonine kinase 23 

121847 AA446628 Hs.2799 cartilage linking protein 1 

121871 AW972668 Hs.293044 ESTs 
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131971 
131977 
131985 
131991 
132019 
132062 
132084 
132103 
132105 
132116 
132176 
132180 
132194 
132207 
132235 
132252 
132266 
132273 
132276 
132288 
132294 
132298 
132299 
132325 
132370 
132374 



AA732153 

AU076408 

BE268278 

AW966881 

AL355715 

NMJJ03512 

T93500 

AL389951 

BE393822 

R78195 

AB037791 

AW410601 

AW960597 

AI218918 

X52486 

BE559681 

AA642831 

D13757 

AK001641 

AI878932 

AA382590 

D87077 

AW966127 

BE501849 

X86098 

U 20536 

U28838 

AI251317 

AA083764 

BE502341 

AA099014 

AF078866 

AA179298 

AW207440 

AA025976 

AF151048 

BE541211 

BE252983 

AA355113 

AK000046 

W79283 

BE567100 

U90441 

AA503020 

AF053306 

H56995 

BE266155 

NNL002267 

BE171921 

AV646076 

AW960474 

AA857025 

NM.004460 

R42432 

BE206939 

AV658411 

A1566004 

AA301228 

AA227710 

AA653507 

N36110 

AB023191 

NM.015986 

AW405882 

N37065 

AW572805 

AF155582 



Hs.27865 Homo sapiens cDNA: FU21333 fis, clone C 

Hs.28309 UDP-^Iucose dehydrogenase 

Hs.28393 hypothetical protein MGC2592 

Hs.41639 programmed cell death 2 

Hs.28555 programmed cell death 9 (PDCD9) 

Hs.28777 H2A histone family, member L 

Hs.28792 . Homo sapiens cDNA FU1 1041 fis, done PL 

Hs.271623 nucleoporin50kD 

Hs.29645 Homo sapiens mRNA; cDNA DKFZp761C029 (fr 

Hs.29692 Homo sapiens cDNA FU1 1436 fis, clone HE 

Hs.29716 ' hypothetical protein FU1 0980 

Hs.30026 HSPC182 protein 

Hs.30164 ' ESTs 

Hs.30209 KIAA0854 protein 

Hs.3041 uracii-DNA glycosylase 2 

Hs.30736 . KIAA0124 protein 

Hs.31016 putative DNA binding protein 

Hs.31 1 phosphoribosyl pyrophosphate amidotransf 

Hs.31 323 inhibitor of kappa light polypeptide gen 

Hs.317 ' topoisomerase (DNA) I 

Hs.170980 K1AA0948 protein 

Hs.196275 KIAA0240 protein 

Hs.32246 Homo sapiens cDNA FU14656 fis, done NT 

Hs.323 17 high-mobility group 20B 

Hs.301449 adenovirus 5 E1A binding protein 

Hs.3280 caspase 6, apoptosis-related cysteine pr 

Hs.32935 TATA box binding protein (TBP^assodate 

Hs.33184 ESTs 

Hs.6101 hypothetical protein MGC3178 

Hs.3402 ESTs 

Hs.231029 Homo sapiens, done MGC:15961, mRNA, com 

Hs.284296 Homo sapiens cDNA: FU22993 fis, done K 

Hs.3439 stomatin-like 2 

Hs. 1 85973 degenerative spermatocyte (homolog Droso 

Hs.34569 ESTs 

Hs.183180 anaphase promoting complex subunit 1 1 (y 

Hs.34804 Homo sapiens cDNA FU1 1472 fis, done HE 

Hs.35086 ubiquftin specific protease 1 

Hs.35380 x 001 protein 

Hs.267448 hypothetical protein FU20039 

Hs.35962 ESTs 

Hs.154938 hypothetical protein MDS025 

Hs.3622 procollagen-proline, 2-oxoglutarate 4-dI 

Hs.36563 hypothetical protein FU22416 

Hs.36708 budding uninhibited by benzimldazoles 1 

Hs.37372 Homo sapiens DNA binding peptide mRNA, p 

Hs.3832 clathrin-associated protein AP47 

Hs.3886 karyopherin alpha 3 (imporUn alpha 4) 

Hs.3991 ESTs 

Hs.39959 ESTs 

Hs.40289 ESTs 

Hs.8878 klnesin-fike 1 

Hs.418 fibroblast activation protein, alpha 

Hs.4212 ESTs 

Hs.42287 E2F transcription factor 6 

Hs.42656 K1AA1681 protein 

Hs.141269 Homo sapiens cDNA: FU21550 fis, done C 

Hs.43299 hypothetical protein FU12890 

Hs.43658 DKFZP586L151 protein 

Hs.285711 hypothetical protein FU13089 

Hs.305971 solute carrier family 2 (facilitated glu 

Hs.44131 K1AA0974 protein 

Hs.7120 cytokine receptor-like molecule 9 

Hs.44205 corfetatin 

Hs.44856 hypothetical protein FU12116 

Hs.46645 ESTs 

Hs.46744 corel UDP-galactose:N-acetyIgalactosamin 
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132376 

132384 

132393 

132450 

132452 

132456 

132470 

132484 

132518 

132530 

132532 

132534 

132543 

132574 

132596 

132611 

132612 

132616 

132638 

132668 

132692 

132715 

132718 

132724 

132731 

132744 

132760 

132771 

132773 

132784 

132798 

132807 

132810 

132813 

132815 

132817 

132821 

132833 

132842 

132844 

132851 

132869 

132873 

132875 

132891 

132897 

132902 

132912 

132913 

132940 

132942 

132952 

132962 

132972 

132973 

132977 

132980 

132994 

133012 

133015 

133062 

133069 

133091 

133110 

133134 

133145 

133152 



AI279892 

AA312135 

AL135094 

AA 100012 

AW973521' 

AB011084 

AI224456 

X16660 

AW885606 

AA306105 

AA454132 

BE388673 

BE568452 

AW631437 

AK001484 

AA345547 

H12751 

BE262677. 

AI796870 

AB018319 

AW191962 

F11875 . 

NMJ04600 

AI142265 

AI189075 

AA010233 

AA125985 

Y10275 

AA459713 

AI142133 

AI026701 

U07418 

AB007944 

BE313625 

AI815189 

N27852 , 

AJ251595 

U78525 

NNL016154 

F12200 

U09716 

AW963217 

AW0076B3 

NM-004850 

BE267143 

AW503667 

AI936442 

AW732760 

W78714 

T79136 

AA554458 

AI658580 

AA576635 

AA034365 

AA035446 

AA093322 

AA040696 

AA1 12748 

AA847843 

AJ002744 

AW500374 

BE247441 

AK001628 

AA808177 

AF198620 

H94227 

211695 



Hs.46801 sorting nexln 14 

Hs.46967 HSPC034 protein 

Hs.47334 hypothetical protein FU 14495 

Hs.48827 hypothetical protein FU12085 

Hs.247324 mitochondrial ribosomal protein S14 

Hs.48924 KIAA0512 gene product; ALEX2 ' 

Hs.4934 Rsapiens polyA site DNA 

Hs.1 19007 RAB4, member RAS oncogene family 

Hs.5064 . ESTs 

Hs.50785 SEC22, vesicle trafficking protein (S. c 

Hs.5080 mitochondrial ribosomal protein L16 

Hs.5086 hypothetical protein MGC10433 

Hs.5101 protein regulator of cytokinesis 1 

Hs.5184 TH1 drosophila homolog 

Hs.5298 CGI-45 protein 

Hs.53263 hypothetical protein FU1 3287 

Hs.5327 PR01914 protein 

Hs.283558 hypothetical protein PR01 855 

Hs.54277 DNA segment on chromosome X (unique) 992 

Hs.5460 KIAA0776 protein 

Hs.249239 collagen, type VIII, alpha 2 

Hs;5534 Homo sapiens cDNA FU12961 fis, clone NT 

Hs.554 Sjogren syndrome antgen A2 (60kD, ribon 

Hs.55498 geranyigeranyl diphosphate synthase 1 

Hs.301872 hypothetical protein MGC4840 

Hs.55921 glutamyl-proiyl-tRNA synthetase 

Hs.56145 thymosin, beta, identified in neuroblast 

Hs.56407 phosphoserine phosphatase 

Hs.295901 K1AA0493 protein 

Hs.56845 GDP dissociation inhibitor 2 

Hs.5716 KIAA0310 gene product 

Hs.57301 mutL (E. coll) homolog 1 (colon cancer, 

Hs.5737 K1AA0475 gene product 

Hs.57435 solute carrier family 1 1 (proton-coupled 

Hs.57475 sex comb on midieg homolog 1 

Hs.57553 tousled-tike kinase 2 

Hs.169610 CD44 antigen (homing function and Indian 

Ks.57783 eukaryotic translation initiation factor 

Hs.279771 Homo sapiens" clone PP1596 unknown mRNA 

Hs.5811 chromosome 21 open reading frame 59 

Hs.287912 lectin, mannose-binding, 1 

Hs.203961 ESTs, Moderately similar to AF1 16721 89 

Hs.58598 KIAA1266 protein 

Hs.58617 Rho-associated, colled-coil containing p 

Hs.59271 U2(RNU2) small nuclear RNA auxiliary fac 

Hs.59545 ring linger protein 15 

Hs.59838 hypothetical protein RJ10808 

Hs.167578 Homo sapiens cDNA FU11 095 fis, done PL 

Hs.60257 Homo sapiens cDNA FU13598 fis, done PL 

Hs.127243 Homo sapiens mRNA for K1AA1724 protein. 

Hs.197751 KIAA0666 protein 

Hs.61426 Homo sapiens mesenchymal stem ceQ prate 

Hs.6153 CGI-48 protein 

Hs.288924 Homo sapiens cDNA FU11392 fis, clone HE 

Hs.323277 ESTs 

Hs.301404 RNA binding motif protein 3 

Hs.62016 ESTs 

Hs.279905 clone HQ0310 PRO0310p1 

Hs.6271 1 Homo sapiens, done IMAGE:3351295. mRNA 

Hs.246315 UDP-N-acetyl-alpha-D-ga!actosamlne:polyp 

Hs.64056 PRO0149 protein 

Hs.6430 protein with polyglutamine repeat; calci 

Hs.64691 KIAA0483 protein 

Hs.65228 ESTs 

Hs.65648 RNA binding motif protein 8A 

Hs.6592 Homo sapiens, done IMAGE:2961368, mRNA, 

Hs.324473 mltogen-activated protein kinase 1 
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133174 
133175 
133177 
133208 
133226 
133228 
133254 
133268 
133291 
133314 
133321 
133327 
133347 
133360 
133366 
133367 
133370 
133390 
133391 
133394 
133437 
133452 
133453 
133500 
133529 
133543 
133578 
133579 
133582 
133594 
133595 
133599 
133621 
133627 
133631 
133649 
133690 
133720 
133722 
133751 
133757 
133760 
133765 
133780 
133797 
133822 
133842 
133845 
133865 
133867 
133868 
133922 
133924 
133929 
133936 
133941 
133959 
133976 
133989 
133997 
134010 
134015 
134070 
134110 
134129 
134134 
134200 



AA431620 Hs.324178 hypothetical protein MGC2745 

AW955632 Hs.66666 ESTs, Weakly similar to S1 9560 profine-r 

X97795 Hs.66718 RAD54 (S.cerevisiae)-llke 

AI801777 Hs.6774 ESTs 

AW954569 Ms.296287 Homo sapiens, Similar to bromodomain-con 

A1492924 Hs.6831 golgi phosphoproteln 1 

AI567421 Hs.273330 . . Homo sapiens, done 1MAGE:3544662, mRNA, 

AW956781 Hs.293937 ESTs, Weakly similar to FXD2_HUMAN FORKH 

BE297855 Hs.69855 NRAS-related gene 

AA102670 Hs.70725 gamma-aminobutyric acid (GABA) A recepto 

T79526 Hs.179516 ' integral type I protein 

AL390127 Hs.7104 KruppeWike factor 13 

BE257758. Hs.71475 acid duster protein 33 

AI016521 , Hs.71816 v-akt murine thymoma viral oncogene homo 

AA292811 ' Hs.72050 non-metastatic cells 5, protein expresse 

AF231919 Hs.16759 KIAA0539 gene product 

AF245505 Hs.72157 DKF2P564I1 922 protein 

AI950382 Hs72660 phosphatidylsertne receptor 

AW103364 Hs727 inhlbin, beta A (activin A, adivin AB a 

AA305127 Hs.237225 hypothetical protein HT023 

AL031591 Hs7370 phosphotidylinositol transfer protein, b • 

NM.002759 Hs.274382 protein kinase, Interferon-inducible dou 

AI659306 Hs73826 protein tyrosine phosphatase, non-recept 

AW964804 Hs74280 hypothetical protein FU22237 

W45623 Hs74571 ADP-ribosylation factor 1 

AU077073 Hs.108327 damage-specific DNA binding protein 1 (1 

AU077050 Hs75066 transfin 

X75346 Hs.75074 rnrtogen-activated protein kinase-adivat 

BE391579 Hs75087 Fas-activated serine/lhreonlne kinase 

AW160781 Hs.172589 nudear phosphoproteln similar to S. car 

AA393273 Hs75133 transcription factor6-like 1 (mitochond 

NNL002885 Hs751 51 RAP1 , GTPase activating protein 1 

NM.004893 Hs75258 H2A histone family, member Y 

NM.002047 Hs75280 glycyMRNA synthetase 

NM-000401 Hs75334 exostoses (multiple) 2 

U25849 Hs75393 acid phosphatase 1 , soluble 

AV661 185 Hs75574 mitochondrial ribosomal protein L19 

L27841 Hs75737 pericenWolar material 1 

AW969976 Hs.279009 matrix Gla protein 

Hs.334787 

Hs.196209 RAE1 (RNA export 1, S.pombe) homotog 

Hs.181357 (aminfn receptor 1 (67kD, ribosomal prot 

Hs75929 cadherin 1 1, type 2, OB-cadherin (osteob 

Hs.76152 decorin 

Hs76272 retinoblastoma-blnding protein 2 

Hs.699 peptidylprolyl isomerase B (cydophifin 

Hs.285013 putative human HLA class II associated p 

Hs76704 ESTS 

Hs.170290 discs, large (Drosophita) homotog 5 

Hs76989 WAA0097 gene product 

Hs.183874 cuiDn 4A 

Hs.77608 splidng fador, arglnlne/serine-rich 9 

Hs.325948 vesicle docking protein p1 15 

Hs.21 1602 SMC 1 (structural maintenance of chromoso 

Hs.77719 gamma-glutamyl carboxylase 

Hs.77770 adaptor-related protein complex 3, mu 2 

Hs.77897 splidng factor 3a, subunit 3, 60kD 

Hs.169946 GATA-bindlng protein 3 (T-cell receptor 

Hs78202 SWI/SNF related, matrix associated, act! 

Hs78281 regulator of G-proteln signalling 12 

Hs.1971 14 RNA binding protein; AT-rich element bin 

Hs.278569 sorting nexin 17 

Hs78946 cullin3 

Hs79136 UV-1 protein, estrogen regulated 

Hs79305 K1AA0255 gene product 

Hs.173328 protein phosphatase 2, regulatory subuni 

Hs.197803 KIAA0160 protein 
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T52946 

BE271766 
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AL133921 

D50525 

AW797468 

AA147026 

AB011155 

AW340125 

AB012193 

U30825 

D86326 

NIVL006306 

L17128 

BE244332 

X81789 

AI908165 

AL040328 

AI824113 

AB016092 

D31764 

NMJ03590 

U41060 

NM.014742 

H86504 

BE559598 
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AF107463 

NNL000402 

BE300078 

A1878910 

AI906291 

AW502505 

U61397 

AB029023 

AW903838 

N92036 

NNL004922 

AW291946 

AA339449 

X06560 

AW362124 

AI589941 

AA417383 

AA456539 

AW067903 

BE272095 

AI750762 

AU077196 

Z44190 

AA112036 

M58603 

X54942 

NMJJ05000 

X82153 

AW246273 

AA425473 

AK001571 

BE091005 

AW411479 

BE244323 

AA927177 

AW068223 

AF035119 

X78520 

AK001741 

BE391929 

U62317 

NM_003474 

BE161887 

Y14768 

AA852985 

AF129536 

X07871 

AW63Q803 

BE002798 

AW451370 

AI701162 

BE268328 

AI879195 

AW885909 

AW401361 

AI097346 

R50333 

AB037835 

NWL000408 

AW301984 

AW503733 

AB036063 

AF027219 

AA081258 

AI093155 

BE250865 

AA477514 



Hs.79968 splicing factor 30, survival of motor ne 

Hs.80206 glucose-6-phosphate dehydrogenase 

Hs.80449 Homo sapiens, clone IMAGE:3535294, mRNA, 

Hs.3688 dsplatin resistance-associated overexpr 

Hs.81 234 immunoglobulin superfamily, member 3 

Hs.81360 Homo sapiens cDNA: FU21927 fls, clone H 

Hs.81424 ubtquitin-fike 1 (sentrin) 

Hs.179946 KIAA1 100 protein 

Hs.8180O chondroitin sulfate proteoglycan 2 (vers 

Hs.8184B RAD21 (S. pombe) homolog 

Hs.81964 SEC24 (S. cerevislae) related gene femfl 

Hs.82065 Interieukln 6 signal transducer (gp1 30, 

Hs.82285 phosphoribosytglycinamide formyltransfer 

Hs.82396 2\5'-oIigoadenylate synthetase 1 (40-46 

Hs.323193 hypothetical protein MGC3222 

Hs.8254 . Homo sapiens, Simitar to tumor different 

Hs.82582 integrin, beta-like 1 (with EGF-like rep 

Hs.8262 lysosomal 

Hs.82772 collagen, type XI, alpha 1 

Hs.167791 reticulocalbln 1 , EF-hand calcium bindin 

Hs.8291 1 protein tyrosine phosphatase type I VA, m 

Hs.829B5 collagen, type V, alpha 2 

Hs.83023 peroxisomal biogenesis factor 11B 

Hs.83419 KIAA0252 protein 

Hs.83428 nuclear factor of kappa light polypeptid 

Hs.83758 CDC28 protein kinase 2 

Hs.83916 Empirically selected from AFFX single pr 

Hs.83942 cathepsin K (pycnodysostosis) 

Hs.84131 threonykRNA synthetase 

Hs.84429 KIAA0971 protein 

Hs.273357 hypothetical protein FU10709 

Hs.74861 activated RNA polymerase II transcriptlo 

Hs.848 FK506-binding protein 4 (59kD) 

Hs.85951 exportln, tRNA (nuclear export receptor 

Hs.86041 CGG triplet repeat binding protein 1 

Hs.171581 ubiquitin C-terminal hydrolase UCH37 

Hs.8700 deleted in liver cancer 1 

Hs.174139 chloride channel 3 

Hs.8739 hypothetical protein FU10879 

Hs.8752 transmembrane protein 4 

Hs.88251 arylsulfataseA 

Hs.8850 a disintegrin and metalbproteinase doma 

Hs.88799 anaphase-promotlng complex subunit 10 

Hs.890 lysosomal 

Hs.89232 chromobox homolog 5 (DrosophOa HP1 alph 

Hs.284226 F-box only protein 6 

Hs.89476 CD2 antigen (p50), sheep red blood cell 

Hs.89497 laminBI 

Hs.287850 integral membrane protein 1 

Hs.8991 adaptor-related protein complex 1, gamma 

Hs.90207 hypothetical protein MGC1 1 138 

Hs.90280 5^minoimidazote-4-carboxamide ribonude 

Hs.90606 15 kDa selenoprotein 

Hs.6975 PRO1073 protein 

Hs.91773 protein phosphatase 2 (formerly 2A), cat 

Hs.286049 phosphoserine aminotransferase • 

Hs.921 66 Leman coiled-coll protein 

Hs.92991 KIAA1414 protein 

Hs.93201 glycerol-3-phosphate dehydrogenase 2 (ml 

Hs.173685 hypothetical protein FU12619 

Hs.9414 K1AA1488 protein 

Hs.94262 p53-inductble ribonucleotide reductase s 

Hs.9443 zinc finger protein 202 

Hs.132390 zinc finger protein 36 (KOX 18) 

Hs.95420 JM27 protein 

Hs.279529 px19-like protein . 

Hs.96247 translin-associated factor X 
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135207 

135214 

135243 

135245 

135257 

135263 

135274 

135294 

135295 

135307 

135321 

135354 

135361 

135389 

135400 

302256 

302276 

303135 

303686 

310085 

315518 

317781 

320836 

321114 

322221 

322474 

322556 

323541 

407827 

408196 

408813 

409176 

409487 

413670 

414108 

414846 

416040 

416980 

417378 

418283 

418467 

420269 

420802 

421225 

421642 

421828 

421983 

422052 

422055 

423750 

424001 

425182 

425284 

426372 

428049 

428477 

437562 

438449 

441560 

445580 

446999 

447111 

447778 

448873 

449687 

450701 

450703 



N26427 

T78802 

BE463721 

AI028767 

AW291023 

A1088775 

AA448460 

AA150320 

AI090838 

AT743770 

A1652069 

AA456454 

AA373452 ■ 

U05237 , 

X78592 

AA857131 

AW057736 

AW592789 

AK000714 

R43191 

AA808229 

NM.007057 

AI268997 

AA902256 

N24236 

AF118083 

BE041451 

AF292100 

BE278431 

AL034548 

AI580090 

R73727 

H19886 

AB000115 

AI267592 

AW304454 

AW819158 

AA381133 

R57256 

S79B95 

NM 006910 

U72937 

U22376 

AA463798 

AF172Q66 

AW891965 

AI252640 

AA302744 

NM.014320 

AF165883 

W67883 

AF041259 

AF155568 

BE304680 

AW183765 

AW500533 

AB001636 

AK001333 

F13386 

AF167572 

AA151520 

AI017574 

BE620592 

NWL003677 

W68520 

H39960 

M0112Q2 



Hs.9634 ESTs, Highly similar to C10.HUMAN PUTAT1-- 
Hs.96560 ' hypothetical protein FU 1 1656 

Hs.97101 putative G protein-coupled receptor 

Hs 262603 ESTs 

H&97255 ESTs, Weakly similar to A46010 X-Bnked 

Hs.55498 geranyfeeranyl diphosphate synthase 1 

Hs.1 12017. GE36gene 

Hs.9800 protein kinase NJmu-R1 

Hs.98006 ESTs 

Hs.98368 , ESTs, Weakly similar to KIAA0822 protein 

Hs.98614 rtbosome binding protein 1 (dog 180kD ho 

Hs.183418 cell division cycle 2-Iike 1 (PITSLRE pr 

Hs.167700 Homo sapiens cONA FU10174 fis, clone HE 

Hs.99872 fetal Alzheimer antigen 

Hs.99915 androgen receptor (dihydrotestosterone r 

Hs.171595 HIV TAT specific factor 1. 

Hs.323910 HER2 receptor tyrosine kinase (o*nVb2, 

Hs.279474 HSPC070 protein 

Hs.109441 MSTP033 protein 

Hs.101248 Homo sapiens clone IMAGE32553, mRNA seq 

Hs.1 67771 ESTs 

Hs.42650 ZW10 interactor 

Hs. 1 97289 rab3 GTPase-activating protein, non-cata 

Hs.78979 Golgi apparatus protein 1 

Hs.179662 nucteosome assembly protein 1-Rke 1 

Hs.29494 PR01912 protein 

Hs.177507 hypothetical protein 

Hs.104613 RP42homotog 

Hs.40323 BUB3 (budding uninhibited by benzimldazo 

Hs.43627 SRY (sex determining region Y)-box 22 

Hs.48295 RNAhelicase family 

Hs.101617 ESTs, Weakly similar to T32527 hypotheti 

gb:yn57a05.h Soares aduit brain N2b5HB5 

Hs.75470 hypothetical protein, expressed in osteo 

Hs.75761 SFRS protein kinase 1 

Hs.77495 UBX domain-containing 1 

Hs.289044 Homo sapiens cDNA RJ12048 fis. clone 

Hs.80684 high-mobility group (nonhistone chromoso 

Hs.82037 TATA box binding protein (TBP)-associate 

Hs.83942 cathepsin K (pycnodysostosis) 

Hs.85273 retinoblastoma-blnding protein 6 

Hs.96264 alpha thalassemia/mental retardation syn 

Hs.1 334 v-myb avian myeloblastosis viral oncogen 

Hs.102696 MCT-1 protein 

Hs.1 06346 retinoic acid repressible protein 

Hs.279789 histone deacetylase 3 

Hs.1 10364 peptidylprolyi isomerase C (cydophlOn 

Hs.104518 ESTs 

Hs.11 1029 putative heme-binding protein 

Hs.298229 prefoldtn 2 

Hs.137476 paternally expressed 10 (PEG10; KIAA105 

Hs,1 55040 zinc finger protein 217 

Hs.155489 NS1-associated protein 1 

Hs.169531 DEAD/H (Asp-Glu-Ala-Asp/HIs) box polypep 

Hs.182238 GW128 protein 

Hs.1 1482 splicing factor, arginine/serine-rfch 1 1 

Hs.5683 DEAD/H (Asp-GKj-AIa-Asp/His) box polypep 

Hs.6216 Homo sapiens hepatocellular carcinomas 

Hs.7888 Homo sapiens done 23736 mRNA sequence 

Hs.12912 skbl (S. pombe) homotog 

Hs.334822 hypothetical protein MGC4485 

Hs. 17409 cysteine-rich protein 1 (intestinal) 

Hs.71190 ESTs, Weakly similar to S16506 hypotheti 

Hs.22393 density-regulated protein 

Hs.331 328 intermediate filament protein synooOin 

Hs.288467 Homo sapiens cDNA FU12280 Os, done MA 

Hs.184771 nudear factor I/C (CCAAT-blnding transc 
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452461 

452511 

453157 

453658 

100685 

100690 

100833 

100850 

101161 

102481 

102831 

103549 

103749 

104331 

104532 

104563 

105032 

105039 

106531 

106977 

107298 

108717 

110018 

110330 

111391 

111392 

113554 

113722 

115008 

115062 

115121 

117881 

119075 

119615 

120253 

125006 

127609 

128868 

128891 

128959 

129209 

129449 

129453 

129629 

129917 

129922 

129989 

130182 

130365 

130471 

130542 

130586 

130768 

130992 

131047 

131135 

131339 

131760 

131774 

131853 

131881 

131887 

132031 

132192 

132203 

132240 

132348 



N78223 

BE408178 

AF077035 

BE541906 

AA328229 

AA383256 

AF135168 

AA836472 

NM.006262 

U50360 

AA262170 

BE270465 

AL135301 

AB040450 

AI498763 

AL1 17403 

AA127818 

AA907305 

AA454036 

AL043152 

N95657 

AA122393 

AW579842 

A1288666 

NM.003896 

W46342 

AW503990 

AV653556 

AK001827 

AA253314 

AI634549 

AF161470 

M10905 

AL034423 

AA131376 

BE065136 

X80031 

AA419008 

F34856 

AI580127 

R62676 

AI096988 

AW974265 

AK000398 

M30773 

AF042379 

AB015856 

BE267033 

W56119 . 

AL1 21438 

U64675 

AB007891 

AF25B627 

BE398091 

H23230 

NM016569 

AF058696 

X76732 

BE267158 

AI681917 

AW361018 

W17064 

AF193844 

AA206153 

NNL004782 

AB018324 

AW067708 



Hs.108106 transcription (actor 

Hs.285165 Homo sapiens cDNA FU20845 fis, clone AD 

Hs.31989 DKF2P586G1722 protein 

Hs.87819 Homo sapiens, done MGC:2492, mRNA, comp 

Hs.184582 ribosomal protein L24 

Hs.1657 estrogen receptor 1 

Hs.108802 N-ethytmaleimide-sensitive factor 

Hs.297939 cathepsinB 

Hs.37044 peripherin 

gb:Human calcium, catmoduOn-dependentp 

Hs.80917 adaptor-related protein complex 3 t slgma 

Hs.78793 protein kinase C, zeta 

Hs.8768 hypothetical protein FU10849 

Hs.279862 cdk inhibitor p21 binding protein 

Hs.203013 hypothetical protein FU12748 

Hs.306189 DKFZP434F1735 protein 

gb:zl12a02.s1 Soares_pregnanLutenjs_NbH 

Hs.36475 ESTs 

Hs.8832 ESTs 

Hs.50421 KlAA0203'gene product 

Hs.6820 ESTs, Moderately similar to YQJ1_CAEEL H 

Hs.70811 hypothetical protein FU20516. 

Hs.104557 hypothetical protein FU10697 

Hs.16621 DKFZP434I116 protein 

Hs.225939 slalyltransferase 9 (CMP-NeuAcrtactosylc 

Hs.325081 Homo sapiens, clone IMAGE:3659680, mRNA, 

Hs.142442 HP1-BP74 

Hs.184411 albumin 

Hs.87889 hellcase-moi 

Hs.154103 LIM protein (similar to rat protein kina 

Hs.88155 ESTs 

Hs.260622 butyrate-induced transcript 1 

Hs.287820 fibronectlnl 

Hs.75875 ubiqultin-conjugating enzyme E2 variant 

Hs.326401 fibroblast growth factor 12B 

Hs.145696 splicing factor (CC1.3) 

Hs.530 collagen, type IV, alpha 3 (Goodpasture 

Hs.1 06730 chromosome 22 open reading frame 3 

Hs.292457 Homo sapiens, clone MGC:16362, mRNA, com 

Hs.107381 hypothetical protein FU11200 

Hs.17820 Rho-associated, coiled-coil containing p 

Hs.1 1 1554 ADP-ribosylation factor-like 7 

Hs.1 11632 Lsm3 protein 

Hs.1 1747 hypothetical protein FU20391 

Hs.278540 protein phosphatase 3 (formerly 2B), reg 

Hs. 1 3386 gamma-tubulin complex protein 2 

Hs.247433 activating transcription factor 6 

Hs.192853 ubiqultin-conjugating enzyme E2G 2 (homo 

Hs.1 55103 euxaryotic translation initiation factor 

Hs.183706 adducln 1 (alpha) 

Hs.179825 RAN binding protein 2-Ske 1 

Hs.16349 KIAA0431 protein 

Hs.211562 ATP-binding cassette, sub-family A (ABC1 ■ 

Hs.74316 desmoplakln (DPI, DPII) 

Hs.22481 ESTs, Moderately similar to A4601O X4In 

Hs.267182 TBX3-iso protein 

Hs.25812 Nijmegen breakage syndrome 1 (nibrtn) 

Hs.3164 nucieobindin2 

Hs.169474 DKFZP586J0119 protein 

Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 

Hs.3383 upstream regulatory element binding prot 

Hs.332848 SWl/SNF related, matrix associated, acti 

Hs.3758 COP9 complex subunit 7a 

Hs.4209 mitochondrial ribosomal protein L37 

Hs.194714 synaptosomal-associated protein, 29kD 

Hs.42676 KIAA0781 protein 

Hs.170311 heterogeneous nuclear ribonucleoprotein 
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132526 
132571 
132726 
132863 
133016 
133053 
133197 
133240 
133266 
133285 
133383 
133540 
133784 
133791 
133650 
133859 
133881 
134208 
134403 
134724 
134806 
134859 
135193 



778736 

AW674699 

N52298 

BE268048 

AI439688 

AI065016 

AI275243 

AK001489 

AI160873 

M76477 . 

BE313555 

AL037159 

BE622743 . 

M34338 , 

W29092 

U86782 

U30872 

NMJ00288 

AA334551 

AF045239 

AD001528 

D26488 

X95525 

AA243007 

T70541 

X57766 

S66431 

AA453483 

R63925 

AA173417 

AA280588 

AA504223 

AA609996 

F02907 

AA480103 

AA024664 

AA251776 

AA399047 

N34059 

U95367 

AA490899 

T54762 

Z41963 

AA521186 

AA400195 

AA045083 

AA099589 

W85712 

W45728 

U61232 . 

AA425154 

T39176 

AA496000 

W38150 

T96595 

AA227463 

R46025 

AA233177 

AA338760 

AA412106 

L47276 

D82307 

AA293568 

R37778 

AA250843 

W49521 

D80000 



Hs.50758 

Hs.5169 

Hs.55608 

Hs.236494 

Hs.6289 

Hs.6390 

Hs.180201 

Hs.242894 

Hs.69233 

Hs.289082 

Hs.7252 

Hs.74619 

Hs.301064 

Hs.76244 

Hs.7678 

Hs.178761 

Hs.77204 

Hs.79993 

Hs.82767 

Hs.321576 

Hs.89718 

Hs.90315 

Hs.96103 



SMC4 (structural maintenance of chromoso 
suppressor of G2 allele of SKP1 , S. cere 
hypothetical protein MGC955 
RAB10, member RAS oncogene family 
hypothetical protein RJ20886 
Homo sapiens clone FLB3344 PRO0B45 mRNA, 
hypothetical protein FU20671 
ADP-rlbosylation factor-like 1 
zinc finger protein 



KIAA1224 protein 
proteasome (prosome, macropaln) 26S subu 
arfaptin 1 

spermidine synthase 

cellular retlnoic add-blnding protein 1 

26S proteasome-associated pad1 homofog 

centromere protein F (35O/400kD, mitosin 

peroxisomal biogenesis factor 7 

sperm specific antigen 2 

ring finger protein 22 

spermine synthase 

KIAA0007 protein 

TATA box binding protein (TBP)-assodate 

ESTs 

ESTs 

Human stromerysin-3 mRNA 

Homo sapiens done 23592 mRNA sequence 

ESTs 



7.4 

6.9 

14.3 

10.3 

4.4 

1.8 

1.8 

1.8 

16.1 

10.4 

15 

1.7 

12.1 

9.7 

4.2 

12 

9.1 

3.2 

1.4 

1.4 

2.6 

13.3 

3.1 

1.6 

2.5 

4.5 

3.1 

4.6 



ESTs 1.4 
ESTs 1.9 
ESTs 2.2 
ESTs Highly similar to CHROMOSOME 2.4 
ESTs Highly similar to Surf4 protein [M.musculusl 5.5 
ESTs 2.3 
ESTs Weakly similar to 111! ALU SUBFAMILY J 2.8 
Human NADH:ubiquinone oxkioreductase subunlt 6.2 
ESTs 2.3 
ESTs 2.4 
EST-RC.N34059 3.3 
Human GABA-A receptor pi subunlt mRNA complete cds 
ESTs 3.3 
ESTs 2.9 
Homo sapiens HP protein (HP) mRNA complete cds 1 .3 
ESTs 1.6 
ESTs 1.3 
VITAMIN K-DEPENDENT GAMMA-CARBOXYLASE 
Homo sapiens mRNA for GDP dissociation inhibitor beta 
ESTs Weakly similar to PROCOLLAGEN ALPHA 2(W 
ESTs Highly similar to HETEROGENEOUS 3.7 
Human tubulin-folding cofactor E mRNA complete cds 



ESTs 



5.3 
2.6 
1.9 
1.7 
1.8 



ESTs Weakfy similar to ZK1056.4 [Cetegans] 
ESTs 

EST-RCJY38150 
EST-RCJ96595 
ESTs Weakly similar to No definition line found [C.elegans] 
ESTs 2.8 
ESTs 2 
ESTs 1.3 ' 

ESTs 6.2 
EST-L47276 3.4 
ESTs Weakly similar to TH1 protein p.melanogaster] 



ESTs 
ESTs 

Interferon regulatory factor 5 

Human prolyl 4-hydroxyIase alpha (II) subunlt 

Human mRNA for KIAA0178 gene partial cds 
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R99978 

AA1 95036 

Z38501 

U37547 

AA479961 

X57579 

AA449071 

N51855 

AA421213 

AA355201 

N78717 

N73808 

U86782 

AA234817 

013666 

AA236177 

U50648 

M28211 

AA446949 

W03007 

W61011 

W87544 

X02751 

Z14077 

Z38839 

AA410894 

AA504499 



ESTs Weakly similar to Iine-1 protein ORF2 [Hsaptens] 6/L 

Human Ro/SSA ribonucleoprotein homotog (RoRet . 5.3 ? 

ESTs Weakly simitar to PROBABLE E5 14 other 

Human IAP homotog 8 (MIHB) mRNA complete cds 3.2 o\her 

ESTs 1.7 other 

Inhibin beta A (activin A activin AB alpha polypeptide) 15.8 

ESTs 1.3 TM 

ESTs Moderately similar to NAD{+) ADP- 1.3 other 

ESTs Weakly similar to F28F8.3 [Celegans] 22 other 

ESTs 1.2 SS.TM 

Rsapfens mRNA fortranslin 1.5 ? 

ESTs 5 ? 

Human 26S proteasome-associated pad1 22 other 

ESTs 1.3 . other 

Homo sapiens mRNA for osteoblast specific 7.5 SS, 
ESTs 7.1 ? 
Protein kinase interferon-lnducibte double 4.1 ? 

Homo sapiens GTP-blnding protein (RAB4) 2.9 • other 

ESTs . 2.2 other 

ESTs 12 other 

ESTs ^2 other 

ESTs 1.2 other 

Neuroblastoma RAS viral (v-ras) oncogene homotog 1.2 ? 

YY1 transcription factor 12 other 

ESTs 1.2 ? 

ESTs 1.7 other 

ESTs Highly similar to probable dMoiide channel 3 [Ksap , 1.3 



other 
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TABLE 7A 

Table 7 A shows the accession numbers for those pkeys lacking unigenelD's for Table 7. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number 



Accession 



102481 31281.-28 U50360 
20 105032 genbankJ\A127818 AA127818 

409487 1134778J . H19886 AW4028Q6 T10231 
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TABLE 8: Figure 8 from BRCA 001-1 US 

Table 8 shows genes upregulated in tumor tissue compared to normal breast tissue. 
Specifically, one column shows the ratio of expression of the indicated gene in breast tumor 
tissue compared to other body tissues, and another column shows the ratio of expression of 
the indicated gene in breast tumor tissue compared to normal breast tissue. 



Pkey: 

ExAccn: 

UnlgenelD: 

Unigene Title: 

R1: 

R2: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal body tissue 

Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelD Unigene Title 



100075 
100229 
100282 
100271 
100355 
100522 
100552 
100599 
100676 
100690 
100895 
101046 
101086 
101148 
101161 
101201 
101212 
101441 
101447 
101469 
101567 
101600 
101624 
101674 
101861 
101977 
102193 
102199 
102304 
102345 
102457 
102534 
102541 
102827 
102962 
102991 
103119 
103175 
103286 
103319 
103372 
103419 
103471 
103546 



AF152333 

AV652249 

D38500 

BE160081 

AI907114 " 

X51501 

AA019521 

X77343 

X02761 

AA383256 

U01351 

K01160 

AA382524 

NM.002923 

NMJ06262 
L22524 

AI186220 

AW468397 

M21305 

AA310162 

M33552 

BE561617 

M55998 

NMJX)2291 

AA350659 

AF1 12213 

AL036335 

AA334592 

AF015224 

NM.003480 

NM_001394 

U96759 

AI379954 

BE244588 

R50032 

AW293542 

X63629 

X69089 

D38616 

X83492 

BE536700 

T34708 

Y00815 

Z14244 



Hs.284160 protocadherfn gamma subfamily B, 4 
Hs.180107 . polymerase (DNA directed), beta 
Hs.278468 postmetotic segregation increased 2-Rke 
Hs.256290 S100 calcium-binding protein A1 1 (calgiz 
Ks.71465 squatene epoxUase 
Hs.99949 prolactin-induced protein 
Hs.301946 lysosomal 

Hs.334334 transcription factor AP-2 alpha (acfivat 
Hs.287820 fibronectin 1 
Hs.1657 estrogen receptor 1 
Hs.75772 nuclear receptor subfamily 3, group C, m 

NMJX)2122:Homo sapiens major histocbmpatl J 
Hs.250959 histattnl 

HsJ8944 regulator of G-protein slgnalQrtg 2, 24k 
Hs.37044 periphertn 

Hs.2256 matrix metaltoprotelnase 7 (MMP7; uterin 
Hs.83164 collagen, type XV, alpha 1 
Hs.100000 S100 calcium-binding protein A8 (catgran 

gb:Human alpha satellite and satellite 3 
Hs.169248 cytochrome c 
Hs.56729 lysosomal 
Hs.1 19192 H2Ahistone family, member Z 

gb:Human alpha-1 collagen type I gene, 3 
Hs.82124 laminin, beta 1 
Hs.83347 angio-assodated, migratory ceil protein 
Hs.184062 putative Rab5-interacting protein 
Hs.313 secreted phosphoprotein 1 (osteoponiin, 
Hs.79914 lumtean 
Hs.46452 mammagbbln 1 
Hs.300946 MicrofibriUssociatedglycoprotein-2 
Hs.2359 dual specificity phosphatase 4 
Hs.198307 von Hippei-Undau binding protein 1 
Hs.79025 K1AA0096 protein 
Hs.6456 chaperonin containing TCP1 , subunit 2 (b 
Hs.159263 collagen, type VI, alpha 2 
Hs.75309 eukaryotic translation elongation factor 
Hs.2877 cadherin 3, type 1 , P-cadherin (placenta 
Hs.79227 myomesin (M-protein) 2 <165kD) 
Hs.54941 phosphoryiase kinase, alpha 2 (river) 
Hs.82359 tumor necrosis factor receptor superfami 
Hs.4888 seryl-tRNA synthetase 
Hs.272927 Sec23 (S. cerevlslae) homofog A 
Hs.75216 protein tyrosine phosphatase, receptort 
Hs.75752 cytochrome c oxidase subunit Vlib 
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R1 


R2 


1 


3.8 


1.7 


5.3 


0,8 


4.8 


3.2 


2.3 


3.3 


1.4 


11.9 


0.4 


3.8 


\2 


9.4 


9.4 


3 


7.8 


4.4 


4.4 


1 


3.9 


41.7 


4 


0.8 


4.1 


M 


12 


3.1 


1.1 


4.4 


0.6 


3.1 


3.4 


0.9 


4.2 


29.9 


0.3 


0.8 


4.9 


1 


5.9 


2.8 


4 


3.1 


1.7 


1.5 


4.1 


3.1 


1.4 


1.3 


6.9 


1.9 


4.9 


2.2 


3.8 


4.2 


0.7 


1.1 


42 


4.5 


0.5 


1.4 


4.2 


0.9 


3.9 


1.5 


10.9 


2.2 


62 


5.6 


5.7 


3.7 


0.5 


1.3 


4 


1.3 


3.8 


0.8 . 


4.6 


0.9 


8 


1.1 


5.1 


3.7 


12 


0.9 


4.4 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



103558 
103758 
103774 
103821 
103869 
103980 
104054 
104115 
104189 
104230 
104278 
104295 
104319 
104425 
104432 
104464 
104479 
104505 
104592 
104613 
104636 
104782 
104792 
104848 
104849 
104850 
104852 
104861 
104873 
104891 
104920 
104926 
104952 
104963 
104977 
105030 
105035 
105068 
105159 
105178 
105182 
105274 
105303 
105413 
105426 
105432 
105443 
105483 
105492 
105495 
105539 
105594 
105623 
105807 
105812 
105823 
105831 
105851 
105879 
105918 
105939 
105941 
105969 
105990 
106012 
106060 
105070 



NM_000088 

AA084874 

H24185 

AA095971 

BE439604 

AW130242 

AK001913 

AF183810 

AB040927 

AB002347 

AW583693 

AW365522 

AW804296 

AF283775 

X51501 

AW966728 

AK001731 

W94824 ■ 

AW630488 

AF123303 

R82252 

AW270555 

AA960961 

AA305351 

AI279065 

AL133035 

W70164 

AA058630 

W03831 

W44626 

AW955089 

BE298808 

AW076098 

AB029020 

AI392640 

BE613061 

N39760 

BE410438 

AF146277 

AA313825 

BE407961 

AI554929 

BE243327 

AI015709 

W20027 

W03516 

M252372 

AL137257 

A1605717. 

AL037715 

AB040884 

AB024334 

BE504200 

AA788946 

BE614149 

AI559444 

AA329449 

AI827976 

BE392914 

AW028485 

AL137728 

AB033075 

AB030656 

AI690586 

AI240665 

NMJXM329 

T74445 



Hs.172928 collagen, type I. alpha 1 3.2 

gb:zn13e04.ri Stratagene hNT neuron (937 0.9 

hypothetical protein 1.9 

Homo sapiens cDNA: FU22463 fis, done H \2 



Hs.92918 
Hs.198793 
Hs.24322 
Hs.293476 
Hs.7100 
Hs.26102 
Hs.301804 
Hs.15303 
Hs.109253 
Hs.103657 
Hs.9950 
Hs.35380 
Hs.99949 
Hs.54642 
Hs.106390 
Hs.11565 
Hs.325820 
Hs.24713 
Hs.106106 
Hs.171774 
Hs.305953 
Hs.274369 
Hs.241507 
Hs.8728 
Hs.20107 
Hs.29759 
Hs.20597 
Hs.30627 
Hs.306083 
Hs.33363 
Hs.74316 
Hs.173694 
Hs.18272 
Hs.337772 
Hs.8859 
Hs.9006 
Hs.265561 
Hs.21941 
Hs.18271 
Hs.281866 
Hs.182626 
Hs.172089 
Hs.23439 
Hs.76698 
Hs.12144 
Hs.23458 
Hs.289112 
Hs.28785 
Hs.109694 
Hs.250O1 
Hs.30127 
Hs.16869 
Hs.20814 
Hs.293960 
Hs.247302 
Hs.24391 
Hs.30503 
Hs.26136 
Hs.12258 
Hs.10669 
Hs.17377 
Hs.29403 
Hs.8895 
Hs.171391 
Hs.5957 



ATPase, H+ transporting, lysosomal (vacu 1.4 

hypothetical protein FKSG44 1.6 

hypothetical protein. 1.5 

opposite strand to trichorhlnophatangea! 7 

KIAA1494 protein 2 

KIAA0349 protein 0.7 

N-termlnal acetyltransferase complex ard 3.3 

hypothetical protein PR022 1 9 2.3 

Sec61 gamma 3.1 

x 001 protein 4 

prolactiiwnduced protein 3.8 

methionine adenosyltransferase II. beta 0.8 
Homo sapiens mRNA; cDNA DKFZp586H0924 (f 

RIKEN cDNA 2010100012 gene 2 

protease, serine, 23 1.9 

hypothetical protein 1.1 

protein kinase (cAMP-dependent, catalyfi 1.2 

hypothetical protein 1.4 

zinc finger protein 83 (HPF1) 1.5 

uncharacterized hypothalamus protein HAR 1.1 

ribosomal protein S6 1.3 

hypothetical protein DKFZp434G171 1.2 

ESTs 0.8 
RNA POLYMERASE I AND TRANSCRIPT RELEASE 

host ceil factor homolog 0.8 

ESTs 0.7 
Novel human gene mapping to chomosome 22 

DKFZP434N093 protein 3.3 

desmoplakin (DPI, DPI!) 1.2 

KIAA1097 protein 1.1 

amino acid transporter system A1 3.2 

Homo sapiens, Similar to RIKEN cDNA 0610 1 .6 

Homo sapiens, Similar to RIKEN cDNA 5830 1.5 

VAMP (vesicle-associated membrane protei 1.1 

CD2-associated protein 1.2 

AD036 protein 3.6 

golgi phosphoprotein 3 1 .7 

ATPase. H+ transporting, lysosomal (vacu 1.1 

chromosome 22 open reading frame 5 1.5 . 
Homo sapiens mRNA; cDNA DKF2p586l2022 (f 

ESTs 4.3 

stress-associated endoplasmic reticulum 1 .5 

KIAA1033 protein 1.2 

Homo sapiens cDNA: FU2301 5 fis, clone L 1 .7 

COM3 protein 2 

microfibriliar-assodated protein 3 1.3 

KIAA1451 protein 2.7 

tyrosine 3-monooxygenase/tryptophan 5-mo 1.3 

hypothetical protein 1.7 

ESTs, Moderately sknOar to CA1C RAT COL 3.9 

CGI-27 protein 1.8 

ESTs 1.9 

twisted gastrulatlon 13 

hypothetical protein FU13612 3.8 

Homo sapiens cDNA FU1 1344 fis, clone PL 1.7 

hypothetical protein MGC14156 1.7 
Homo sapiens mRNA; cDNA DKFZp434B0920 (f 



1.3 
1.1 
2 

4.1 
2.6 

Homo sapiens clone 24416 mRNA sequence 1.4 



development and differentiation enhancm 
coronin, actin-binding protein, 10 
hypothetical protein FU22060 
ESTs 



3 

10 

15.9 

3.9 

3.9 

4.1 

4.3 

7 

4.6 
4.5 
3.3 
42 
7 

1.3 
0.6 
6.7 
1.7 
7.5 
7.4 
6.3 
4 

3.9 

42 

4.1 

4.6 

3.6 

42 

1.7 

5.4 

6.6 

1 3.9 

3.3 

3.7 

5.5 

I. 4 . 

II. 4 
7.2 
3.5 
10 
8.3 
6.8 
3.7 
4 

I. 5 
2.9 
5 

3.6 
15.8 
4.8 
3.9 

II. 4 
6.1 
4.5 
24.6 
3.6 
6.6 
4.3 
1.9 
4 

7.4 
1.2 
4.6 
5.9 
4.6 
U 
7 

10.7 



4.8 



5.1 



14 



3.8 
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106083 
106155 
106255 
106414 
106538 
106566 
106574 
106613 
106617 
106619 
106701 
106721 
106776 
106866 
106868 
106887 
106940 
106968 
107052 
107061 
107149 
107222 
107233 
107295 
107679 
107914 
107965 
108033 
108060 
106081 
108137 
108186 
108215 
108297 
108339 
108371 
108399 
108469 
108470 
108564 
108641 
108668 
108694 
108824 
108863 
108893 
108992 
109072 
109097 
109160 
109244 
109481 
109484 
109795 
.110009 
110107 
110411 
110731 
110756 
110930 
110935 
111051 
111110 
111356 
111357 
111770 
111900 



H62087 

AA425414 

BE613206 

BE568205 

AK000274 

AW051564 

BE044325 

N88604 

H09548 

AA459480 

BE387614 

AA741038 

AA206079 

AA487416 , 

BE185536 

BE503373 

T85594 

AF216751 

BE391904 

BE147611 

AI289507 

BE172058 

BE267795 

AA186629 

AA011510 

AA027229 

AF109219 

AW368993 

AA291440 

AA093668 

AI283611 

AW068579 

AI879238 

AA333660 

AW151340 

AA074374 

AF086070 

AA079487 

AA079500 

M23114 

AA1 12059 

AA058522 

AA036725 

AK001332 

AA133456 

BE276891 

AA152312 

AI732585 

AA167512 

BE220601 

BE179030 

AA878923 

AA366263 

AA173942 

BE075297 

AW151660 

AW001579 

NM.014899 

N21207 

BE242691 

Af753230 

A1681293 

AK001566 

BE301871 

BE314949 

R27975 

AF131784 



Hs.31 659 thyroid hormone receptor-associated prot 1 3 

Hs.33287 nuclear factor l/B 5.4 

Hs.279607 calpastatin 1.8 

Hs.28827 mitogen^acflvated protein kinase kinase 5.1 

Hs.278635 HDCMA18P protein 1.2 

Hs.28285 patched related protein translocated in 1.8 

Hs.227280 U6 snRNA-associated Sm-Oke protein 2.3 

Hs.30212 thyroid receptor interacting protein 15 . 1.2 

Hs.5367 ESTs, Weakly similar to I38022 hypothetl 0.9 

Hs.23956 hypothetical protein FU2Q502 1.3 

Hs.25797 spiia'ngfactor3b,subunit4 ) 49kD 1.6 

Hs.6670 ESTs , 1.7 

Hs.6693 hypothetical protein RJ20420 1 

Hs.268231 Homo sapiens cDNA: FU231 1 1 fe, clone L 1.6 

Hs.301183 molecule possessing ankyrin repeats Indu 3.3 

Hs.334335 hypothetical protein FU13576 1.4 

Hs.339808 hypothetical protein FU10120 3.3 

Hs.26813 CDA14 3 

Hs.12482 glyceronephosphate O-acyltransferase 1.7 

Hs.6354 ' stromal cell derived factor receptor 1 1.2 

Hs.299883 hypothetical protein FU23399 1.8 

Hs.82689 tumor rejection antigen (gp96) 1 \2 

Hs.22595 hypothetical protein FU10637 1.4 

Hs.80120 UDP^^atykilpha-0-galactosamlne:poIyp 2.6 

Hs.60512 ESTs 1.8 

Hs.61329 ESTs, Weakly similar to T16370 hypothetl .1.3 

Hs.108787 phosphatidyiinositol glycan, class N 1.6 
Hs.323748 Homo sapiens clone CDABP0086 mRNA sequen 

Hs.731 49 paired box gene 8 1 .1 

Hs.28578 musdeblind(Drosophi(aHike 0.7 
Hs.263479 ESTs, Weakly similar to HMG1_HUMAN HIGH 
Hs.7780 Homo sapiens mRNA; cDNA DKFZp564A072 (fr 

Hs.299315 coliapsin response mediator protein-5; C 1.5 

Hs.71331 hypothetical protein MGC5350 1.5 
Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALU S 

Hs.67639 ESTs U 

Hs.237519 EST 1 

gb:zm97fl)8.s1 Stratagene colon HT29 (937 1.5 

gb:zm96h10.s1 Stratagene colon HT29 (937 1.1 

Hs.1 526 ATPase, Ca++ transporting, cardiac musd 2 

Hs.429 ATP synthase, H+ transporting, mltochond 1.1 

Hs.185751 ESTs 1.2 

Hs.61847 ESTs . 1.4 

Hs.44672 hypothetical protein FU10470 1.4 

Hs.102548 glucocorticoid receptor DNA binding fact 1.2 

Hs.194691 retinoic acid induced 3 1.3 

Hs.72047 ESTs 1.1 

Hs.22394 hypothetical protein FU 10893 1.2 

gb:zp10f12.s1 Stratagene fetal retina 93 1.3 

Hs.301997 hypothetical protein FU13033 . 4 
Hs.64239 Human DNA sequence from clone RP5-1 174N9 

Hs.289069 hypothetical protein FU21 01 6 3.8 

Hs.72531 hypothetical protein FU11838 1.9 
Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (f 

Hs.6614 ESTs, Weakly simiiar to A43932 mucin 2 p 4.6 

Hs.31 444 ESTs 1.2 

Hs!9645 Homo sapiens mRNA for KIAA1741 protein, 3.7 

Hs. 188006 K1AA0878 protein 2.8 

Hs.182999 ESTs 1.6 

Hs.14947 ESTs 3.1 

Hs.323562 hypothetical protein DKFZp564K142 1.9 

Hs.12186 hypothetical protein FU22558 2 

Hs.23618 hypothetical protein FU10704 1.1 

Hs.4867 mannosyl(aipha-1,3-)-glycoprotBinbeta- 1 

Hs.87128 hypothetical protein FU23309 3.3 

Hs.269401 ESTs, Moderately similar to S65657 atpha 1.2 

Hs.25318 Homo sapiens clone 25194 mRNA sequence 3.2 



3.6 
\2 
4 

6.1 

5.9 

5.4 

11.2 

3.6 

4.4 

3.6 

7.3 

6.1 

5.4 

5.4 

1.2 

6.3 

1.8 

3 

7.6 

4.3 " 

6.5 

6.9 

3.5 

4.3 

4 

3.5 
3.5 
1.8 
3.5 
5.6' 
\2 
3.1 
4.6 
4 

6.3 
3.8 
3.6 
3.6 
4.3 
4.9 
3.5 
3.6 
3.6 
3.5. 
4 

3.6 
4.1 
3.5 
5 

6.1 
1.7 
7.7 
4 

3.7 
7.4 
3.5 
3.3 
3.7 
3.5 
1.2 
7.5 
4 

3.8 
8.2 
6.1 
5.4 
0.8 



8.1 



5.6 
6.9 



4.7 



7.4 



1.3 
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111903 NNL014906 Hs.166351 KIAA1072 protein 1 . 

111951 NM.014927 Hs.100527 KIAA0902 protein 1 

112141 AW137198 Hs.278682 PhosphatldylglycerophosphateSynthase 1.4 

112193 R49499 Hs.138238 ESTs 1.5 

112197 NM.003655 Hs.5637 ESTs .4.6 

112610 AW500106 Hs.23643 serine/threonine protein kinase MASK . 3.3 

112971 Z42387 Hs.83883 transmembrane, prostate androgen Induced 3.2 

112984 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.7 

113056 AF019226 Hs.8036 glioblastoma overexpressed 4.5 

113449 AW160683 Hs.158006 hypothetical protein 1.2 

113497 AF143321 Hs.15572 hypothetical protein IMAGE 109914 0,9 

113508 AL042936 Hs.21 1571 holocytochrome c synthase (cytochrome c 1 .1 

113531 AK001898 Hs.16740 hypothetical protein RJ1 1036 1.2 

113604 AI075407 Hs.296083 ESTs, Moderately slmflar to I54374 gene 1.7 

113674 NM-014214 Hs.5753 lnositol(myo}-1(or4)-monophosphatase2 0.8 

113841 W30681 Hs.146233 Homo sapiens cONA: FU22130 fis, clone H 1.7 

113857 AW243158 Hs.5297 DKFZP564A2416 protein . 1.2 

113931 BE255499. Hs.3496 hypothetical protein MGC15749 1.5 

113936 W17056 Hs.83623 nuclear receptor subfamily 1, group I, m 3.8 

113987 AA345519 Hs.9641 complement component 1,q subcomponent, \2 

1 14132 AI342493 Hs.24192 Homo sapiens cONA FU20767 fls t done CO 0.3 

114156 BE179882 Hs.336920 glutathione peroxidase 3 (plasma) 1.1 

114213 N58309 Hs.19575 CGt-11 protein 1.6 

114636 AA075488 gb:zm88d01.s1 Stratagene ovarian cancer " 1.6 

114760 AI929382 Hs.252692 hypothetical protein FU20343 1.4 

114781 T10446 Hs.95388 ESTs 1 

114795 AB037858 Hs.173484 hypothetical protein FU 10337 1.6 

114901 AV660012 Hs.196437 hypothetical protein FU 10788 1.4 

115096 AI683069 Hs.175319 ESTs 3.7 

115518 BE541042 Hs.23240 Homo sapiens cDNA: FU21848 fis, done H 3.2 

115646 N36110 Hs.305971 solute earner family 2 (facilitated glu 1.5 

115764 AW582256 Hs.91011 anterior gradient 2 (Xenepus laevis) horn 1.3 

115802 AW410233 Hs.206521 YME1 (S.cerevlslaeHike 1 1.7 

115994 AB037836 Hs.10931 5 KIAA1415 protein 1.5 

116032 BE383668 Hs.42484 hypothetical protein FU 1061 8 0.9 

116046 BE395293 , Hs.94491 hypothetical protein FU20297 1.6 

116274 AI129767 Hs.182874 guanine nudeotide binding protein (Gpr 3.2 

116310 Z24854 Hs.42299 ESTs 0.8 

1 16356 AI371223 . Hs.288671 Homo sapiens cDNA FU1 1997 fis, done HE 2.4 

116429 AF191018 Hs.279923 putative nudeotide binding protein, est 5.5 

116461 AA313607 Hs.58633 Homo sapiens cDNA: FU22145 fe, clone H 5 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 8.7 

116578 021262 Hs.75337 nudedar and coHed-booYr^osphproteto 3.2 

116579 AW888411 Hs.81915 leukemia-associated phosphoprotein p18 ( 3.2 
116589 AI557212 Hs.17132 ESTs, Moderately similar to I54374 gene 3.1 
116786 H25836 Hs.301527 ESTs, Moderately similar to unknown [H.s 3.2 
117170 N25929 Hs.42500 ADP-ribosylation factor-like 5 7 
117215 N20066 Hs.133207 PTPRF Interacting protein, binding prote 1.2 
117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, done C 4.5 
117576 A1383467 Hs.44597 ESTs 1.4 
117667 U59305 Hs.44708 Ser-Thr protein kinase related to the my 4.3 
117881 AF161470 Hs.260622 butyrate-induced transcript 1 2.1 
118336 BE327311 Hs.47166 HT021 3.6 
118475 N66845 gb2a46c1U1 Soares fetal Ever spteen 4.2 
118493 AL353944 Hs.50115 Homo sapiens mRNA; cDNA DKFZp761J1112 (f 
118505 N67343 gb:yz50b07.s1 Morton Fetal Cochlea Homo 2.1 
119159 AF142419 Hs.15020 homoiog of mouse quaking OKI (KH domain 3.7 
119307 BE048061 Hs.37054 ephrin-A3 3 
119355 BE218319 Hs.5807 GTPaseRab14 .1.1 
119462 BE041667 Hs.314544 Homo sapiens cervical cancer suppressor- 1.4 
119771 A1905687 Hs.2533 EST 3.2 
119940 AL050097 Hs.272531 DKFZP586B0319 protein* 4.3 
119943 BE565849 Hs.14158 copinelll 3.5 
120407 AA235207 Hs.250456 hypothetical protein DKFZp762F2011 1.5 
120493 AW968080 Hs.152939 Homo sapiens done 24630 mRNA sequence 4 
120677 AF1 50208 Hs.1 08327 damage-spedficDNA binding protein 1(1 1.6 
120867 AA350781 Hs.96967 ESTs 1.1 
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21368 BE262956 Hs.178292 KIAA01 80 protein 15 

21603 AA416765 Hs.249495 heterogeneous nuclear ribonucleoprotein 2.2 

21723 AA243499 Hs,104800 hypothetical protein FU10134 3.4 

22223 AF169797 Hs.27413 adaptor protein containing pH domain, PT 3.9 

22378 AB032948 Hs.21356 hypothetical protein DKFZp762K2015 . 1.4 

22946 AI718702 Hs.308026 major histocompatibility complex, class 1.4 

23155 AF121856 Hs.284291 sorting nexln 6 ' 1.2 

23158 AF161426 Hs.218329 hypothetical protein 2.4 

23327 AA421581 Hs.178443 ESTs 0.9 

23495 W28673 Hs.106747 serine carboxypeptidase 1 precursor prot 1.3 

23526 AA608657 gbae55d04.s1 Stratagene lung carcinoma 2.1 

23533 AA608751 gb:ae56h07.s1 Stratagene lung carcinoma 2.1 

23768 AI932318 Hs.188762 ESTs, Moderately similar to H2BL_HUMAN H 1.1 



23961 AL050184 Hs.21610 DKFZP434B203 protein 1.1 

23999 AF084555 Hs.7351 cyclic AMP phosphoproteln, 19 kD 1.4 

24000 BE563957 Hs.74861 activated RNA polymerase II transcrfptio 1.9 
24038 AB037860 Hs.173933 nudear factor l/A 1.5 
24059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothetl 14.8 
24083 AW195237 Hs.7734 hypothetical protein FU22174 1.2 
24148 BE300094 Hs.227751 lectin, galactoslde-binding, soluble, 1 2.5 
24153 AU077333 Hs.160483 erythrocyte membrane protein band 7.2 (s 1 
24252 BE613340 Hs.334725 Homo sapiens, Similar to RIKEN cONA 9430 1.5 
24314 AK001552 Hs.215766 GTP-binding protein 1-8 
24375 D87454 Hs.192966 KIAA0265 protein 1.1 
24432 N39016 Hs.268869 ESTs, Weakly similar to ALUCJJUMAN HI! 13 
24447 N48000 gb:yy98e12.s1 Soares_multiple..sderosis_ 2.7 
24539 D54120 Hs.146409 cell division cyde 42 (GTP-binding prot 2,1 
I24543 AI393320 Hs.104573 ESTs 1 
I24564 H66409 Hs.108275 ESTs 1.4 
124574 AL036596 Hs.42322 A kinase (PRKA) anchor protein 2 0.7 
I24605 AA749315 Hs.77171 minichromosome maintenance deficient (S. 1.1 
I24639 H60193 Hs.21143 DKFZP586C1324 protein 1.4 
I24659 AI680737 Hs.289068 Homo sapiens cDNA FU11918 fis, done HE 1.5 
I24737 BE270465 Hs.78793 protein kinase C, zeta 0.7 

1 24760 AW408586 Hs.91052 ESTs, Moderately similar to ALU5_HUMAN A 0.9 

I24763 BE410405 Hs.76288 calpaln 2, (m/ll) large subunlt 1.3 

I24792 R44357 Hs.48712 hypothetical protein FU20736 1.8 

124842 R56485 gb:yg93h09.s1 Soares Infant brain 1NIB H 1 

1 24940 AF068846 Hs.103804 heterogeneous nudear ribonucleoprotein 3.2 

I24949 AI903210 Hs.336780 tubulin, beta polypeptide 1 

124960 AL023513 Hs.194766 seizure related gene 6 (mouse)-0ke ' 0.9 

I24995 T52700 Hs.110044 ESTs 0.9 

I25030 AA610577 Hs.187775 ESTs 1.2 

I25034 BE548446 Hs.5167 Homo sapiens mRNA; cDNA DKFZp434F152 (fr 

1 25058 T83731 Hs.3343 phosphoglycerate dehydrogenase 0.9 



25076 AA973971 gb.-oq02h08.s1 NCI JX3APJ.U5 Homo sapfens 



25090 T91518 gb:ye20fO5.s1 Stratagene lung (937210) H 3.2 

25103 AA570056 Hs.122730 ESTs, Moderately similar to KIAA121 5 pro 5.3 

25154 W38419 gb:zc78aQ7.s1 Pancreatic Islet Homo sapi 0.9 

25155 AA837043 Hs.143669 ESTs 1.1 
25159 AK000669 Hs.274428 TRF2-interacting telomeric RAP1 protein 1.1 
25170 AL020998 Hs.8518 selenoprotein N 1.1 
25181 R40815 Hs.12398 ESTs, Weakly similar to 2004399A chromos 1 
25193 W67577 Hs.84298 CD74 antigen (Invariant polypeptide of m \2 
25260 H05635 Hs.294030 topolsomerase-related funcfon protein 4 1 
25262 AW884980 Hs.171957 triple functional domain (PTPRF Interact 1.3 
25272 BE612888 Hs.180224 myosin regulatory light chain 1.1 
25388 W27235 Hs.64311 a disintegrin and metalloproteinase doma 1.4 
25824 245258 Hs.286013 short coiled-coil protein 2.4 
25852 AW630088 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f 
25970 AW504721 Hs.177516 high density lipoprotein binding protein 1.9 
26192 AW160399 Hs.30376 hypothetical protein 1.4 
26469 BE384361 Hs.182885 ESTs, Weakly similar to JC5024 UDP-galac 2 
126510 AA057593 Hs.334762 hypothetical protein FU14735 1.3 
I27095 AA340277 Hs.10248 Homo sapiens cDNA FU20167 fis, done CO 1.3 
I27524 AI243596 Hs.94830 ESTs, Moderately similar to T03094 A-Wn 4.3 
(28312 J04182 Hs.150101 lysosomal 15 
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X02761 
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H08379 

NWL005904 

NM_003478 
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AW517695 



Hs.287820 
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Hs.165563 

Hs.100602 

Hs.101299 

Hs.38260 
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Hs.103147 
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fibronectlnl 1.2 
ESTs, Highly similar to LDHH_HUMAN L-LAC 3.1 

hypothetical protein DKFZp434N1429 0.6 

MAD (mothers against decapentapiegic Dr 1.3 

cuDIn 5 1 

ubiquttin specific protease 18 0.8 

K1AA0680 gene product 1.3 

hypothetical protein FU21347 1.4 

solute carrier family 7 (catfonte amino 1.2 

WW domain-containing protein 1 0.8 

Hypothetical protein FU20396 3 

hypothetical protein RJ10702 2.8 

CD81 antigen (target of antiproOferaSv 1.1 

secreted frizzled-reiated protein 4 1 

ATPase, H+ transporting, lysosomal (vacu 5.3 

Homo sapiens clone 23836 mRNA sequence 2.2 
ESTs .1 

hypothetical protein 0.6 

DKFZP566D193 protein 4.6 

CocoaCrisp 3 

hypothetical protein 0.2 

programmed eel) death 5 2.5 

hypothetical protein FU20585 4 

MDS023 protein 1.2 

hypothetical protein 1.4 

RAB31, member RAS oncogene family 2.3 

KIAA0909 protein 0.9 

Homo sapiens cDNA FU14028 fis» clone HE 1.5 
NM_002495*:Homo sapiens NADH dehydrogena 

hypothetical protein FU22059 1 .2 

Homo sapiens cDNA FU14368 fis, clone HE 2.1 

DKFZP564M1 12 protein 2.4 

KJAA1 268 protein 1.2 

sterol regulatory element binding transc 1 .2 

DKFZP586A0522 protein 0.5 

CGI-109 protein 1 
Homo sapiens mRNA; cDNA DKFZp566M0947 (f 
C-terminal binding protein 2 
Empirically selected from AFFX single pr 
glucosidase, beta; acid (Includes glucos 
ESTs 

ubiquitin-iike 5 

hematological and neurological expressed 
TAK1 -binding protein 2; KIAA0733 protein 
CGI-131 protein 

Empirically selected from multiple AFFX 
poly(A)~btnding protein, cytoplasmic 1-1 
hypothetical protein 
ras homolog gene family, member H 
putative L-type neutral amino add trans 
ESTs 

ribosomal protein L26 homolog 
tumor metastasis-suppressor 
Meisl (mouse) homolog 
solute carrier family 12 (sodlum/potassi 
CGM 11 protein 
hypothetical protein 
hypothetical protein . 
ferritin, light polypeptide 
heat shock factor binding protein 1 
DKF2P547E1010 protein 
FSHD region gene 1 



STRIN protein 
detta-tubulin 

cisplatin resistance related protein CRR 
junctional adhesion molecule 1 
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129606 AW968941 Hs.166254 hypothetical protein DKF2p566l133 2.4 

129619 AA209534 Hs.284243 tetraspan NET-6 protein 3.2 

129620 D79338 Hs.239720 CCR4-NOT transcription complex, subunlt 1.6 

129621 AL1 10212 Hs.301005 purine-rich element binding protein B 1.1 
129634 AB020335 Hs. 181 300 seM (suppressor of lin-1 2, C.elegans)- 0.9 
129663 AI207406 Hs.11866 translocase of Inner mitochondrial membr 1.9 
129679 AW889132 Hs. 11916 ribokinase 0.9 
129688 U53209 Hs.24937 transformer-2 alpha (htra-2 alpha) 1.3 
129691 M26939 Hs.1 19571 collagen, type III, alpha 1 (Ehlers-Oanl 4.7 
129712 U46386 Hs.12102 sorting nexto 3 U 
129747 AL050272 Hs.12305 DKF2P566B1 83 protein 1 
129788 BE397454 Hs.124969 Homo sapiens clone 24707 mRNA sequence 1.4 

129796 BE218319 - Hs.5807 GTPaseRab14 2.9 

129797 M62839 Hs.1252 apollpoprotein H (beta-2-glycoprotein I) 0.3 
129800 AF052112 Hs.12540 lysosomal 1.6 
129834 AL080084 Hs.296155 CGM0O protein 05 
129836 AW410233 Hs.206521 YME1 (S.cerevislaeHike 1 1.8 
129843 NM.014840 Hs.200598 KIAA0537 gene product 0.9 
129874 AA626937 Hs.181551 hypothetical protein MGC2594 1.4 
129878 243161 Hs.283714 30 kDa protein 1.1 
129904 AL1 19499 Hs. 13285 neuronal potassium channel alpha subunlt 1 
129917 M30773 Hs.278540 protein phosphatase 3 (formerly 2B),reg 2 
129976 X14008 Hs.234734 lysosomal 0.9 
129982 Z14221 gb:H.saplens germGne transcript of Ig h 1.2 
130007 R15917 Hs.142570 Homo sapiens clone 24629 mRNA sequence 4.3 
130060 BE277024 Hs.146381 RNA binding motif protein, X chromosome 1.6 
130064 X57815.comp Empirically selected from AFFX single pr 1.2 
130068 M93143 Hs.262869 plasminogen-like 1.4 
130090 H97878 Hs.132390 zinc finger protein 36 (KOX 18) 1.4 
130095 AK001635 Hs.14838 hypothetical protein FU10773 0.2 
130102 W61005 Hs.14896 DHHC1 protein 1 - 
130112 AA916785 Hs.1 80610 splicing factor proline/glutamine rich ( U 
130115 T47294 Hs.149923 X-box binding protein 1 3.8 
130123 NM.005095 Hs. 150390 zinc finger protein 262 1 
130150 BE094846 Hs.1 51 13 homogenttsate 1,2-dioxygenase (homogenti 0.5 
130161 R42678 Hs.151385 KIAA0564 protein 1 
130210 M23115 Hs.1526 ATPase, Ca-H- transporting, cardiac musd 0.4 
130213 BE278370 Hs.15265 heterogeneous nuclear ribonucleoprotein 1.7 
130215 BE301883 Hs.152707 glioblastoma amplified sequence 1 
130232 U29463 gb:Human cytochrome b561 gen , 1.2 
130252 U92014 Hs.153527 Homo sapiens pTM5 mariner-like transpose 1.3 
130281 W78907 Hs.15395 similar to arginyWRNA synthetase (aigi 1.5 
130343 AB040914 Hs.278628 KIAA1481 protein 2.9 
130385 AW067800 Hs.155223 stannlocalcin 2 3.2 
130414 AW842182 Hs.241392 small Inducible cytokine A5 (RANTES) 1.4 
130417 AW163518 Hs.155485 huntingtin interacting protein 2 1.7 
130440 AA852868 Hs.132853 K1AA0171 gene product 1.1 
130442 NMJ06245 Hs.1 18244 protein phosphatase 2, regulatory subuni 1.4 
130465 AW362955 Hs.15641 Homo sapiens cDNA RJ14415 fis, ctone HE 1.6 
130479 R44163 Hs.12457 hypothetical protein FU10814 0.9 
130499 AB007915 Hs.158286 KIAA0446 gene product 1 
130546 AI598022 Hs.193989 TAR DNA binding protein • 1.3 
130568 AA232119 Hs.16085 putative G-proteln coupled receptor 1.2 
130606 AI652143 Hs.288382 hypothetical protein FU131 11 1 
130612 BE242873 Hs.16677 WD repeat domain 15 1.1 
130616 AL049963 Hs.284205 up-regulated by BCG-CWS 0.6 
130623 AL045128 Hs.1691 glucan (1,4-alpha-), branching enzyme 1 0.9 
130629 AL042896 Hs.1697 ATPase, H+ transporting .lysosomal (vacu 0.9 
130632 AW073971 Hs.238954 ESTs, Weakly similar to KIAA1 204 protein 0.9 
130639 AI557212 Hs.17132 ESTs, Moderately similar to 154374 gene 2.6 
130641 AF158555 Hs.239189 gtutaminase 1.2 
130653 AI861791 Hs.278479 TSPY-fike 1.3 
130655 A1831962 Hs.17409 cysteine-rich protein 1 (intestinal) 2.5 
130666 AL117508 Hs.194035 K1AA0737 gene product 1.3 
130669 AI928985 Hs.17680 hypothetical protein MGC1314 similar to 1.4 

130693 R68537 Hs.17962 ESTs 3.2 

130694 NNL014827 Hs.17969 KIAA0663 gene product 11 
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AA325308 


Hs.18016 Homo sapiens mRNA; cDNA DKFZp586H0324 (f 


18 


30701 


298883 


Hs.16079 phosphatidyllnosito! glycan, class Q 


1.1 


6.7 


30707 


AW190925 


Hs.203559 hypothetical protein FU12701 
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4.1 


30731 


AI932971 


Hs.18593 Homo sapiens cDNA: FU21449 fis, clone C 
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AF072813 


Hs.252831 reticu!on3 
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AA088809 


Hs.19525 hypothetical protein FU22794 
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6.8 
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NM.001761 


Hs.1973 cydinF 
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4.1 


30863 


Y10805 


Hs.20521 HMT1 (hnRNP methyltransferase, S. cemvl 


3.2 


5.9 
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AB037750 


Hs.21061 KIAA1329 protein 


1 


. 3.8 


30908 


AW195747 


Hs.21122 hypothetical protein FU1 1830 similar to 


1.3 


7.9 


30911 


BE409769 


Hs.2 1 169 OnaJ (Hsp40) homolog, subfamily A r membe 27 


3.7 


30913 


BE390905 


Hs.21198 transtocase of outer mitochondrial membr 


1.9 
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30923 


H96115 • 


Hs.21293 UDP-N-actey!glucosamine pyrophosphorylas 1.9 


10.3 


30959 


AB023182 . 


Hs.184523 KIAA0965 protein 


1.5 


6.8 


30967 


AA393071 


Hs.1 82579 leucine aminopeptidase 


1.4 


5.5 


30975 


AA099923 


Hs.283728 PEST-containlng nuclear protein 


1.3 


3.8 


31037 


BE243101 


Hs.22391 chromosome 20open reading frame 3 


1.9 


4.1 


31039 


D87436 


Hs.166318 Gpin2 


16 


3.5 


31060 


AA194422 


Hs.22564 myosin VI 


4.5 
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31097 


AL137682 


Hs.22937 l-kappa-B-interacting Ras-like protein 2 


2 


3.7 


31101 


BE387561 


Hs.22981 DKFZP586M1523 protein 


16 


• 4.5 


31104 


W27770 


Hs.301756 ESTs, Weakly similar to T31475 hypothetl 


0.9 


3.5 


31107 


BE620886 


Hs.75354 GCN1 (general control of amino-acid synt 


2.1 


4.5 


31109 


BE564123 


Hs.23060 DKFZP564F0522 protein 


11 
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31136 


AB033099 


Hs.23413 KIAA1273 protein 


U 


4.2 


31148 


AW953575 


Hs.303125 p53-induced protein PIGPC1 
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X77753 


Hs.23582 tumor-associated calcium signal transduc 


3.4 
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31156 


AI472209 


Hs.323117 ESTs 
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31164 


AW013807 


Hs.182265 keratin 19 


3.3 


2.4 


31181 


H25094 


Hs.293663 ESTs, Moderately similar to 138022 hypot 
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31194 


AW864222 


Hs.24083 KIAA0997 protein 


14 


3.8 


31199 


AW979155 


Hs.298275 amino add transporter 2 


U 


8.5 


31215 


AL050107 


Hs.2434 1 transcriptional co-activator with PDZ-bl 


0.7 


4.7 


31216 


AI815486 


Hs.243901 Homo sapiens cDNA FU20738 fis, done HE 


2.1 


8.2 
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D89053 


Hs,268012 fatty-acid-Coenzyme A iigase, long-chain 


1.7 


3.5 


31237 


AW956868 


Hs.24608 DKFZP564D177 protein 


13 
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AU077158 


Hs.24930 tubulin-specrficchaperonea 


1.6 
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AU077002 


Hs.24950 regulator of G-protein signalling 5 


14 
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AI750575 


Hs.173933 nuclear factor l/A 


3.3 
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AW293399 


Hs. 1 44904 nuclear' receptor co-repressor 1 


16 
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NM_006052 


Hs.26146 Down syndrome critical region gene 3 
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111 
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NM_014810 


Hs.92200 KIAA0480 gene product 


5 


2 


31492 


AI452601 


Hs.288869 nuclear receptor subfamily 2, group F, m 


0.9 
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AW960146 


Hs.284137 hypothetical protein FU12888 
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BE270734 


Hs.2795 lactate dehydrogenase A 
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AB040927 


Hs.301804 KIAA1494 protein 
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AU076408 


Hs.28309 UDP-glucose dehydrogenase 


13 
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31534 


AF157326 


Hs.184786 TBP-interacting protein 


13 
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T47364 


Hs.278613 interferon, atpha-inducible protein 27 


15 
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AA936296 


Hs.234265 DKFZP586G011 protein 
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C18825 


Hs.29191 epithelial membrane protein 2 


13 


6.2 


31609 


D83032 


Hs. 169984 nuclear protein 
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BE514605 


Hs.289092 Homo sapiens cDNA: FU22380 fis, clone H 
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H03514 


Hs.10130 ESTs 
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C19034 


Hs.288613 Homo sapiens cDNA FU14175 fis, clone NT 3.2 
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AF103798 


Hs.30819 hypothetical protein 


13 
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AW160865 


Hs.30886 cytochrome c oxidase subunit Vila polype 


13 
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AF017986 


Hs.31386 secreted frizzfed-related protein 2 


10.6 
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AI805664 


Hs.31731 peroxlredoxln 5 


11 
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AF077036 


Hs.31989 DKFZP586G1722 protein 


16 
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BE407127 
AB001636 
BE616412 
R60366 
BE513940 
AL042986 
BE281316 
H51066 
AW001741 
N73222 
A1634651 ' 
RCLH15847_s 
RCJV84712 
X14008_ma1J 
RC.H86543J 
H07011 

R<LAA164586_s 

RC.AA070485 

RC H98714.S 

RC.jAA406145J 

AA458584 

AA031548 

X02761 

RC^AA487193 

R25326 

R<LAA393805 

RC.AA449333 

RC.jAA287681_s 

RC.M490864 

RC_C14243J 

R21443 

RCJW251902 

M21121_s 

CO0O38_s 

Y00503 

RC R27006J 

RC_AA416886 

RCJW460450 

RCJ^A488433 

RC.AA278400J 

U28831 

RC.M199588 



Hs.337531 
Hs.168005 
Hs.177656 
Hs.7678 
Hs.178761 
Hs.279789 
Hs.324278 
Hs.235887 
Hs.241469 
Hs.279609 
Hs.8997 
Hs.5683 
Hs.286218 
Hs.5822 
Hs.6101 
Hs.7857 
Hs.47334 
Hs.23581 
Hs.273193 
Hs.279009 
Hs.30250 



Homo sapienscDNA FU 12843 fis, clone NT 3.6 

calclneurin-blnding protein calsarcln-1 3.3 

mitochondrial ribosomat protein L43 0.9 

DC2 protein 3 

stroma) antigen 2 6.2 

mitochondrial Import receptor Tom22 \2 

protein kinase, cAMP-dependent, regulate 1 

junctional adhesion molecule 1 1 .5 

kary opherin alpha 1 (importin alpha 5) \2 

CGI-39 protein; ceO death-regulatory pr 1 Z 

reticuton 4 \2 

hypothetical protein FU21776 2 

ESTs, Weakly similar to AF220049 1 uncha 1.3 

liver-specific bHLH-Zip transcription fa 1 .5 

putative heme-binding protein 2 

ESTs, Moderately similar to 138022 hypot 1 .7 

Homo sapiens cDNA RJ 1 3372 Ms, done PL 33 

calmodulin 1 (phosphorylase kinase, delt 1 .3 

cellular retmoic acid-binding protein 1 1.1 

26S proteasome-associated pad1 homoiog 3.2 

historte deacetylase 3 2.8 
Homo sapiens mRNA; cDNA DKFZp566M063 (fr 

HMT1 (hnRNP methyltransferase, S. cerevi 1 & 

hypothetical protein 1.1 

mitochondrial carrier homoiog 2 1 .5 

heat shock 70kD protein 1 A 1 .3 

DEAD/H (Asp-Glu-AlaAsp/His) box polypep 1.6 

junctional adhesion molecule 1 1 2 

Homo sapiens cDNA: RJ22120 lis, clone H 2 

hypothetical protein MGC3178 1.1 

erythrocyte membrane protein band 0.5 

hypothetical protein FU14495 2.5 

leptin receptor gene-related protein 1 .1 

hypothetical protein HJ 10706 1 .4 

matrix Gla protein 4 

v-maf musculoaponeurotic fibrosarcoma (a 0.8 

peptidylprory! isomerase B (cydophiiin B) 1 .8 

calumenln 3.5 

lysozyme (renal amyloidosis) 0.9 
ESTs ,1.8 

ESTs; Weakly similar to SAS [H.sapiens] 1.8 

ESTs 

Homo sapiens clone 23967 3.4 

ESTs 1.6 

ESTs 

SRY (sex determining region Y)-box 4 3.4 
cell division cycle 42 (GTP-binding protein; 25kD) 



fibronecfln 1 
secreted frizzled-related protein 4 
Homo sapiens mRNA for putative vacuolar 
ESTs; Weakly similar to (defline not 
ESTs 

ESTs; Highly similar to heat shock factor 
ESTs; Highly similar to heat shock factor 
ESTs 



3.6 
47 
0.9 
1.1 
2.9 
ESTs 
1.4 
1.7 
1.6 
2.2 
0.9 
2.8 
3.1 
1.6 
3.1 
1.5 



small inducible cytokine A5 (RANTES) 
ESTs 
keratin 19 
ESTs 

ESTs; Weakly similar to predicted using 
fibroblast growth factor receptor 2 (bacteria- 
ESTs; Weakly similar to deduced amino acid 1.1 

Homo 

Human protein Immune-reactive with antl-PTH4.4 
Homo sapiens adin-related protein Arp3 (ARP3) 
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3.6 

I. 6 
4.2 
173 
4 

6.6 
3.5 
4.7 
* 5.6 
3.5 
13.9 
5.1 
3.9 
6.2 . 

II. 3 
3.6 
2.8 
4.7 
3.7 
2.5 
22 

1.1 5.2 
8.8 

5.6 

6.1 

7.6 

6.5 

3.5 

5.7 

6.2 

3.7 

4.9 

3.6 

3.5 

11.2 

5.6 

4.8 

4.6 

4.5 

6.6 

3.9 

6.2 0.8 
£6 

3.5 

4.6 ' 3 
0.4 

3.1 3.9 

15.2 

4 

5 

8.4 
4.6 

1.3 4 
5 

5 

3.7 
3.8 
9.9 
4.8 
1.1 
3.7 
3.1 
3.7 
4 

sapiens HRIHFB2115 mRNA; partial cds 1.5 
0.6 

1.8 47 
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AF006082 


Homo sapiens actin-related protein Arp2 (ARP2) 


1.6 


10.9 


RO.H9Q899 


desmoplakin (DPI; DPI!) 5.4 


5.5 




RCJV95070 


desmoplakin (DPI; DPII) 5 


2.6 




RCJ90946J 


Human mRNA for KIAA263 gene; complete cds 


1.1 


3.9 


D87258 


protease; serine; 1 1 (IGF binding) 2.4 


35 




AA313414_s 


ESTs; Weakly similar to cDNA ESTEMBLT1157 


1.5 


■ 5.3 


RC_H73484_s 


ESTs; Weakly similar to similar to Yeast 1.3 


6.3 




AFFX-Hl) M I SGF3A/M97935_3 


2.3 


13.5 


AFFX-HUMRGE/M10098_5 1.1 


7.9 




AFFX-M27830J 




0.5 


7.4 


AFFX-M27830.5 




0.6 


' 5.4. 


RC_AA083431J . 


ESTs 


0.8 


4.1 


RC_T63769J 


ferritin; light polypeptide 1 .1 


3.7 
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TABLE 8A 



Table 8 A shows the accession numbers for those pkeys lacking unigenelD's for Table 8. For 
each probeset, we have listed the gene cluster number from which the oligonucleotides were 
designed. Gene clusters were compiled using sequences derived from Genbank ESTs and 
mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for 
sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

108469 116761J AA079487 AA128547 AA1 28291 AA079587 AA079600 . 
125076 190299J AA973971 T88817 AA253263 

114636 109698J AA075488 AA1 29081 AA074851 AA082852AA074732 M084908 AA084751 AA076042 AA131172 AA085374 AA079519 

AA074510 AA1 1 3824 AA1 02437 AA070833 M070143 M084693 AA084389 AA076373 AA075492 AA062834 AA084335 
AA078829 AA079344 AA069916 AA079275 AA070914 AA654069 AA081976 AA080957 AA0831 15 AA070942 AA085296 

123526 genban*LAA608657 AA608657 

123533 genbankuAA608751 AA608751 

125090 genbank_T91518 T91518 

125154 genbanK.W38419 W38419 

118475 genbank_N66845 N66845 

118505 genbank_N67343 N67343 

101046 entre*J<01160K01160 

129982 221J67 . Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 M327072 Z14166 Z14167 Z14165 

AW403806 Z14200 AA383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF062115 Z47240 AF062263 AF062261 AF062223 AF062211 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z11946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1.1699 AW402665 L09070 L28049 L08090 
AW407843 Z14171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087L12098 U68231 L121 84 AF062242AF022000U64499 U00570 AI268604Y1 5773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U00488 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF1 15130 AF1 151 17 Z92896 U21254 Z92895 AF1 151 12 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF022011 L43085U00536L19915L43083AF090420AB021530AB021534Z96955Z96954 Z33903 X62964AF103282U00509 

AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 AF103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJ008250 L04323 AF103321 
AF103335 L04342 L03818 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
AF017458 AJ008207 AJ0081 83 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NfvL002179 AW405146 AA301 091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 AI905833 AW405586 AA482084 AI872299 AA715266 AW404328 AI831674 AI709348 AA603112 
AW514864 AA485775 H64492 AW404789 AM87630 M715498 AA295885 T27613 T98113 

108470 genbank J AA079500 AAO795O0 
101447 entrejJ/121305 M21305 
124447 genbank_N48000 N48000 
101624 entre*_M55998 M55998 

131791 221 JB0 X62111 S67984 AJ131056 Z47243 Z47235 AF062268 Z47237 AJ131058 AF062130 AF062282 X62108 AA385989 AA464794 

X69861 AW402964 M90808 Z98735 Z98734 Z98736 AF035018 X79161 U00545 AF174046 AF174071 U00552 U96288 
AF021989 AF062255 AF174061 S73953 AF062135 AF062155 X64147 U00555 AF174070 AF129754 U86789 Z98714 Z98738 
S75886 AJ008175 AJ008186 AJ008188 Y08403 H15753 H22208 AA327241 AW405737 H42300 H39677 H26239 H26903 
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H45128 R66072 AA327565 AI660584 AW361537 AW383759 S71043 H15014 H45570 H42819 H45523 H45134 R72043 
H24543 H27636 H27610 T28147 H25496 AW364071 AW3B4072 H45561 H45556 H42605 AI735017 T47421 R48719 H27570 
H44599 AI459598 H42347 H41938 H24993 AA345888 H22339 AI538691 AJ012264 AA664201 AI880450 AA327310 AI991250 
AI833028 AW001210 AI956075 H30467 AA326915 H41943 AI749266 AI744441 AA327377 AW512326 AI735170 H01634 
AI587047 AI571623 AA327486 AA327103 AA327195 AA326973 T28143 

124842 217726 1 R56485 R37248 R59992 

103758 AA084874_f_at AA084874J 

130064 221 J64 X57815 L29157 AA367448 AW328098 AW404536 L29156 L29163 H27683 R83195 AA295096 AW327822 H25458 AW404692 

X5781 9 X57823 AW405604 AW404447 Z34914 AW406542 AA427726 AA604389 AW405606 AW405918 AW4051 17 
AW407182 L03632 AW405058 L03627 AW407470 R72738 L21959 AW375738 X87888 AF124166 AF054638 AJ241406 
U09902 AVV405284 M12025 AA360219 L03631 L03629 AW327727 AF194810 AF058076 AF194686 AF063774 AFQ63737 
AF063755 AF063740 AF063739 AF063738 S73129 L43088 AF063775 L33026 AF060138 AF194813 AF194812 AF194811 
AF194809 AF194814 AF194808 AF054505 AF0645O3 AF063765 AF063757 AF063752 AF063716 AF063736 AF194806 
AF058077 AF063747 AF063772 AF063781 AF060137 AF194805 AF060134 AF060132 AF058074 AF063754 AF063704 
U38589 Z18332 AF050122 AF194807 AF060135 AF064506 AF064504 AF063773 L26540 AF058072 AF060131 AF064500 
AF060136 AF064513 AF194683 AF194711 AF058075 AF063717 219546 AF194581 X72746 U96393 U09890 X98897 
AF194592 M80916 AJ241405 AF194632 AJ388559 AF194625 274694 AF194588 AF194601 U09901 U09911 U09892 
AJ010336 AJ006162 AJ249377 AJ241414 AA327392 U97248 X72747 Z46850 X95739 AW406701 AJ243109 AF194609 
S80758 AF194595 AF194596 L22483 Z70262 D84141 AW405758 AF001788 AF194580 D84143 U76684 AF194593 L03630 
X87892 X91134 U21249 246346 AJ 132426 AF1 03659 AJ233718 AF021038 AJ233727 H24657 U09882 S75627 AA573599 . 
AF047231 AF047232 AF103616 AW404484 Z46848 S76132 AF103663 AF103713 Y17940 AF047216 AF103595 Y17956 
AF052799 AF052797 AF052802 AF052798 AF052801 AF052794 AF052796 AF047218 AF052800 AF047217 AF052795 
AF032351 AF103701 AF103708 AF103710 AF103706 S62232 AF103645 AF103632 AF103647 AF103644 AF103640 
AF103692 AF093581 AF103620 AW405934 AI445389 AW383753 AA360256 AF099676 H21654 H39501 AI620828 H53689 
W26785 AW384496 AW407708 AA541663 AA911602 AI821461 AA588300 AA327050 H42717 A1951280 AA421322 AI923193 
AA864302 H25133 D87023 J03011 M61771 D87017 AA526865 AA253450 

130232 18831.2 U29463 U06715 W24970 AA584303 T39581 AA155603 AA305043 AA429426 W05664 AA102382 AA482044 W24487 AA31906O 

T88946 F10106 AA232161 AA2431 17 AA158937 M100864 

109097 genbankJ\A167512 AA167512 
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TABLE 9: Figure 9 from BRCA 001-2 US 

Table 9 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: 
10 ExAccn: 
UnigenelD: 
Unigene Title: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 



15 Pkey ExAccn UnigenelD UnlgeneTItie 



20 



25 



30 



35 



100690 
102211 
103587 
104115 
105038 
105500 
105990 
106155 
106373 
106414 
110009 
111900 
114540 
116470 
117280 
119771 
121723 
124059 
131148 
132371 
134169 
302235 
452410 



AA383256 

BE314524 

BE270266 

AF183810 

AW503733 

AW602166 

AI690586 

M425414 

AW503807 

BE568205 

BE075297 

AF131784 

AI904232 

AI272141 

M18217 

AI905687 

AA243499 

BE387335 

AW953575 

AA235448 

AI690916 

AL049987 

AL133619 



Hs.1657 

Hs.78776 

Hs.82128 

Hs.26102 

Hs.9414 

Hs.222399 

Hs.29403 

Hs.33287 

Hs.21907 

Hs.28827 

Hs.6614 

Hs.25318 

Hs.75323 

Hs.83484 

Hs.172129 

Hs.2533 

Hs.104800 

Hs.283713 

Hs.303125 

Hs.46677 

Hs.178137 

Hs.166361 

Hs.29383 



estrogen receptor 1 

putative transmembrane protein 

5T4 oncofetal trophoblast glycoprotein 

opposite strand to trichorhinophalangeal syndrome 1 

KIAA1488 protein 

CEGP1 protein 

hypothetical protein FU22060 

nuclear factor l/B 



mitogen-adivated protein kinase kinase kinase 2 

ESTs, Weakly similar to A43932 mucin 2 precursor, Intestinal 

Homo sapiens clone 25194 mRNA sequence 

prohibitin 

SRY (sex determining region Y)-box 4 

Homo sapiens cDNA: RJ21409 fis, clone COL03924 

EST 

hypothetical protein FU10134 

ESTs, Weakly similar to S64054 hypothetical protein YGL050W 

p53-lnduced protein PIGPC1 

PRO2000 protein 

transducer of ERBB2, 1 

Homo sapiens mRNA; cDNA DKFZp564F112 

Homo sapiens mRNA; cDNA DKFZp434 
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TABLE 10: Figure 10 from BRCA 001-3 PCT 

Table 10 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: 

ExAocn: 

UnrgenelD: 

Unigene Title: 

R1: 

R2: 

R3: 

R4: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of tumor to normal body tissue 
Ratio of 90*> percentile tumor to body 
Ratio of 75 th percentile body to tumor 
Ratio of tumor to normal breast tissue 



Pkey ExAccn UnigenelO Unigene Title 



100082 
100103 
100131 
100147 
100154 
100157 
100169 
100203 
100210 
100219 
100234 
100248 
100252 
100260 
100279 
100286 
100294 
100335 
100365 
100375 
100409 
100410 
100414 
100418 
100438 
100439 
100448 
100449 
100522 
100552 
100643 
100661 
100666 
100667 
100745 
100774 
100783 
100821 
100864 
100877 
100892 
101038 
101046 
101079 



AA130080 

AA380887 

D12485 

D13666 

H60720 

D14661 



Hs.4295 

Hs.5085 

Hs.11951 

Hs.136348 

Hs.81892 

Hs.119 



AL037228 Hs.82043 
.BE242284 Hs.172199 
D26361 Hs.3104 
AW972300 Hs.118110 
D29677 Hs.3085 
NM_015156Hs.78398 
NMJ)Q6207Hs.170040 
D38491 Hs.322478 
D42084 Hs.82007 
BE247550 Hs.86859 
AA331881 Hs.75454 
AW247529 Hs.6793 
AI878927 Hs.79284 
D80004 Hs.75909 
D86957 Hs.80712 
D86961 Hs.79299 
NM_014735Hs.82292 
D86978 Hs.84790 
AA013051 Hs.91417 
AA347720 Hs.122669 
AF234887 Hs.57652 
D87470 Hs.75400 
X51501 Hs.99949 
AA019521 Hs.301946 
NM_005032Hs.4114 
BE623001 Hs.132748 



L05424 
L05424 
BE207168 
J05581 



HS.169610 
Hs.169610 
Hs.144630 
Hs.89603 



AF078847 Hs.191356 
M26460 

BE563957 Hs.74861 

X80821 Hs.27973 

BE245294 Hs.180789 

BE297139 Hs.79411 
K01160 

BE264901 Hs.250502 



proteasbme (prosome, macropaJn) 26S subu 

dolfchyi-phosphate manrrosyltransferase p 

ectonucleotide pyrophosphatase/phosphodi 

osteoblast specific factor 2 (fasddin 

KIAA0101 gene product 

Wilms 1 tumour 1 -associating protein 

D1 23 gene product 

adenylate cyclase 7 

KIAA0042 gene product 

bone marrow stromal cell antigen 2 

KIAA0054 gene product; Helicase 

KIAA0071 protein 

platelet-derived growth factor receptor- 
KIAA01 17 protein 
KIAA0094 protein 

growth factor receptor-bound protein 7 
peroxiredoxln 3 

platelet-activating factor acetylhydrola 

mesoderm specific transcript (mouse) horn 

KIAA0182 protein 

KIAA0202 protein 

lipoma HMGIC fusion partner-like 2 

KIAA0215 gene product 

KIAA0225 protein. 

topoisomerase (DNA) II binding protein 
KIAA0264 protein 

cadherin, EGF LAG seven-pass G-type race 

KIAA0280 protein 

prolactin-induced protein 

lysosomal 

plastin 3 (T isoform) 

Homo sapiens ribosomal protein L39 mRNA, 
CD44 antigen (homing function and Indian 
CD44 antigen (homing function and Indian 
nuclear receptor subfamily 2, group F, m 
mucin 1, transmembrane 
general transcription factor IIH, polype 
gb:Homo sapiens (clone 104) retinoblasto 
activated RNA polymerase II tran 
KIAA0874 protein 
S164 protein 

replication protein A2 (32kD) 



carbonic anhydrase Vlll 



R1 


R2 


R3 


R4 


42 


152 


36 


12.2 




123 


13 


5 




244 


19 


99 




1030 




5 


A 1 


320 


78 


106 


47 


119 


26 


3 


5.1 


106 


21 


9.2 


4.7 


47 


1 


4.3 


4.7 


47 


4 


0.7 


3.8 


350 


93 


1.9 


4.1 


64. 


16 


3 


3.4 


77 


23 


5.9 


4.5 


45 


4 


4 


5.9 


59 


1 


2.6 


3.5 


96 


28 


1.3 


3.1 


306 


98 


1.5 


12.8 


128 


1 


11.7 


U 


1B7 


44 


5.4 


4.5 


129 


29 


3.1 


3.5 


78 


23 


4.8 


10.2 


102 


1 


4.8 


4 


40 


1 


3.8 


3.2 


32 


2 


29 


3.6 


36 


7 


3.2 


5.6 


76 


14 


2 


3.5 


35 


9 


3.1 


5.5 


145 


27 


2.2 


3.4 


34 


1 


1.2 


22.7 


760 


34 


1.4 


14.4 


144 


9 


4.7 


4.1 


259 


63 


1.9 


3.3 


116 


36 


2.2 


8.5 


85 


1 


3.2 


3 


594 


201 


2.3 


5 


82 


17 


0.9 


3.5 


37 


11 


2.8 


9.7 


97 


10 


7.2 


3.3 


33 


1 


0.8 


3.7 


477 


130 


3.1 


6.3 


63 


4 


5.7 


4.7 


47 


1 


4.2 


3.8 


115 


30 


7.1 


t 3.9 


390 


100 


11.1 


3.9 


39 


8 


3.6 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



101084 
101104 
101185 
101188 
101201 
101232 
101275 
101300 
101396 
101447 
101448 
101470 
101478 
101484 
101507 
101621 
101624 
101664 
101684 
101724 
101754 
101767 
101791 
101794 
101803 
101809 
101839 
101888 
101960 
102009 
102095 
102123 
102125 
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133765 M62194 Hs.75929 

133780 AA557660 Hs.76152 

133784 BE622743 Hs.301064 

133814 NM_002462HsJ6391 

5 133829 AW630088 Hs.76550 

133845 AA147026 Hs.76704 

133913 AU076964 Hs.7753 

133968 AA355986 Hs.232068 

133990 R48316 Hs.7822 

10 133999 AA535244 Hs.78305 

134032 NM_005025Hs.78589 

134064 AF091622 Hs.78893 

134087 U51166 Hs.173824 

134089 R51273 Hs.79029 

15 134095 NWL004354HS.79069 

134098 BE513171 Hs.79086 . 

134110 U41060 Hs.79136 

134125 NM.014781HS.50421 

134246 D28459 Hs.80612 

20 134257 C05768 Hs.8078 

134272 X76040 Hs.278614 

134282 R45621 Hs.81057 

134288 AI022650 Hs.8117 

134321 BE538082 Hs.8172 

25 134326 AW903838 Hs.81800 

134328 AW959281 Hs.8184 

134348 AW291946 Hs.82065 

134359 NM.001982HS.199067 

134367 AA339449 Hs.82285 

30 134374 N22687 Hs.8236 

134380 AU077143 Hs.179565 

134395 AA456539 Hs.8262 

134401 AI916662 Hs.211577 

134405 AW067903 Hs.82772 

35 134415 AI750762 Hs.82911 

134417 NM.006416HS.82921 

134419 W95642 Hs.82961 

134421 AUQ77196 Hs.82985 

134436 U29344 Hs.83190 

40 134485 X82153 Hs.83942 

134487 AF061739 Hs.83954 

134495 D63477 Hs.84087 

134520 BE091005 Hs.74861 

134542 M14156 Hs.85112 

45 134570 U66615 Hs.172280 

134590 AW903849 Hs.173840 

134604 NM_002884Hs.865 

134612 AW068223 Hs.171581 

134643 AW299723 Hs.87223 

50 134654 AK0O1741 Hs.8739 

134656 AI750878 Hs.87409 

134672 AF271212 Hs.322901 

134700 AK000606 Hs.8868 

134711 X04011 Hs.88974 

55 134722 AF129536 Hs.284226 

134856 BE281128 Hs.9030 

134880 AI879195 Hs,90608 

134917 X87241 Hs.166994 

134921 AL137491 Hs.125511 

60 134982 AK002085 Hs.92308 

134989 AW968058 Hs.92381 

135029 H58818 Hs.187579 

135035 AL034344 Hs.284186 

135051 AI272141 Hs.83484 

65 135062 AK000967 Hs.93872 

135073 W55956 Hs.94030 

135098 AW274526 Hs.277721 
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TABLE 10A 



Table 10 A shows the accession numbers for those pkeys lacking unigeneED's for Table 10. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

123619 371681J AA602964AA609200 
20 104602 524482J H47610 R86920 

121581 283769 J AA416568 AA442889AA4 17233 AA442223 

123523 genbankJ\A608588 AA608588 

100821 «gr_HT4306 M26460 1)091 16 

125091 genbanLT91518 T91518 
.25 125150 NOTJ : OUND_entre*_W38240 W38240 

118475 genbankJ166845 N66845 

104787 genbank-M0Z7317 AA027317 

106055 genbankJ\A417034 AA417034 

113702 genbank_T97307 T97307 
30 101046 entre*_KQ1160 K01160 

101447 entreO!21305 M21305 

101624 entre?_M55998 M55998 

124677 genbanK_R01073 R01073 

110581 genbanieH61560 H61560 
35 119023 genbank^N98488 N98488 

110775 genbank_N22414 N22414 

112092 genbankj*44538 R44538 

112253 genbankj*51818 R51818 

107014 genbankj4A598820 AA598820 
40 114988 genbanRJlA251089 AA251089 
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TABLE 11: Figure 11 from BRCA 001-3 PCT 

Table 11 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1: Ratio of tumor to normal body tissue 

R2: Ratio of 90 th percentile tumor to norma) body 

R3: Ratio of 75* percentile normal body to tumor 

R4: Ratio of tumor to norma! breast tissue 



Pkey ExAccn UnigenelD Unigene Title 

100131 D12485 Hs.11951 e^nudeoffdepyrophosphatase/phosphodJ 

100147 D13666 Hs. 136348 osteoblast specific factor 2 (fascWin . 

100522 X51501 Hs.99949 protadin-induced protein 

100666 L05424 Hs.169610 CD44 antigen (homing function and Indian 

101104 AW862258 Hs.169266 neuropeptide Y receptor Y1 

101478 NM.002890Hs.758 . RAS p21 protein activator (GTPase actlva 

101724 L11690 Hs.620 bullous pemphigoid antigen 1 (2307240kD) 

101754 S70114 Hs.239489 TIA1 cytotoxic granule^ssociated RNA-bi 

101888 AL049610 Hs.95243 transcription elongation factor A (Sll)- 

102165 BE313280 Hs.159627 death associated protein 3 

102304 AF015224 Hs.46452 mammaglobin 1 

102348 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 

102457 NM_001394Hs.2359 dual specificity phosphatase 4 

102567 U63830 Hs.146847 TRAF family member-associated NFKB activ 

102823 D85390 Hs.5057 carboxypeptidase D 

103557 AL133415 Hs.297753 vimentin . 

103613 NWL000346HS.2316 SRY (sex determining region Y>box 9 (ca 

104115 AF183810 Hs.26102 opposite strand to trichominophalangeal 

104667 AI239923 Hs.30098 ESTs 

104804 AI858702 Hs.318Q3 ESTs, Weakly similar to N-WASP [H^aplen 

104807 A1139058 Hs.125790 leudne-rich repeatcontaining 2 

104896 AW015318 Hs.23165 ESTs 

104943 AF072873 Hs.1 14218 frizzled (Drosophila) homotog 6 

105038 AW503733 Hs.9414 KIAA1488 protein 

105329 AA234561 Hs.22862 ESTs 

105500 AW602166 Hs.222399 CE6P1 protein 

105516 AK001269 Hs.30738 hypothetical protein FU10407 

105730 AW377314 Hs.5364 DKFZP564I052 protein 

106012 A1240665 Hs.8895 ESTs 

106095 AF1 15402 Hs.11713 E74-like factor 5 (ets domain transcript 

106155 AA425414 Hs.33287 nuclear factor l/B . 

107102 AB037765 Hs.30652 K1AA1344 protein 

107136 AV661958 Hs.8207 GK001 protein 

107151 AW378065 Hs.8687 ESTs 

107922 BE153855 Hs.61460 Ig superfamlly receptor LNIR 

108339 AW151340 Hs.51615 ESTs, Weakly similar to ALU7_HUMAN ALU S 

109112 AW419196 Hs.257924 hypothetical protein FU1 3782 

109292 AW975746 Hs.188662 KIAA1702 protein 

109415 U80736 Hs.110826 trinucleotide repeat containing 9 

109912 AW390822 Hs.301528 L-kynurenlne/alpha^inoadipateaminotra 

110009 BE075297 Hs.6614 ESTs. Weakly similar to A43932 mucin 2 p 

110915 BE092285 Hs59724 hypothetical protein FU13187 

1 1 1 164 N46180 Hs.122489 Homo sapiens cONA FU13289 fis, done OV • 
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111179 AK000136 Hs.10760 asporin (LRR class 1) 25.1 288-12 6.7 

111190 AK002055 Hs.151046 hypothetical protein FU1 1193 6.3 63 1 5.8 

111223 AA852773 Hs.334838 KIAA1866 protein 3.6 402 112 4.9 , 

111357 BE314949 Hs.87128 hypothetical protein FU23309 3.8 425 111 4 

5- 112244 AB029000 ' Hs.70823 KIAA1 077 protein 5.7 567 100 6.7 ,i 

113047 AI571940 Hs.7549 ESTs 9.6 124 13 9 

113702 T97307 gb:ye53h05.s1 Soares fetal liver spleen 12.3 129 11 11.7 

114124 W57554 Hs.125019 lymphoid nuclear protein (LAM) mRNA 242 242 10 5.6 

114138 AW384793 Hs.15740 Homo sapiens mRNA; cDNA DKFZp434E033 (fr 6.7 67 1 6.3 

10 114768 AF212848 Hs.1 82339 ets homologous factor 13.7 137 1 . 8.9 

114860 AL157545 Hs.42179 bromodomaln and PHD finger containing, 3 9.1 91 1 7.6 

114965 AI733881 Hs.72472 BMP-R1B 35.9 359 10 29.7 

114986 AA251089 gb:zs04!05.s1 NCLCGAPJ3CB1 Homo sapiens 11.5 115 1 6.9 

115206 AW183695 Hs.186572 ESTs 5.8 58 1 5 

15 115719 AW992405 Hs.59622 Homo sapiens, clone IMAGE:3507281, mRNA, 7.6 144 .19 13.9 

115844 A1373062 Hs.332938 hypothetical protein MGC5370 6.2 • 62 . 1 5.4 

116470 AI272141 Hs.83484 SRY (sex determining region Y)-box 4 1.8 1047 596 1.6 

116786 H25836 Hs.301527 ESTs, Moderately similar to unknown \Hs 225 228 9 12.4 

117280 M18217 Hs.172129 Homo sapiens cDNA: FU21409 fis, clone C 3.9 322 83 4.4 

20 117412 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 17.4 174 9 6.9 » 

118472 AL157545 Hs.42179 bromodomaln and PHD finger containing, 3 145 145 1 2.4 

119271 AI061118 Hs.65328 Fanconi anemia, complementation group F 8.2 82 1 6.4 

. 119771 AI905687 Hs.2533 EST 3.5 2073 595 2.1 

120562 BE244580 Hs.302267 hypothetical protein FU10330 8.5 .127 15 1.6 

25 121463 AK000282 Hs.239681 hypothetical protein FU20275 10.3 103 1 9.3 

121723 AA243499 Hs.104800 hypothetical protein FU10134 2.9 214 74 3.7 

122963 AA478446 Hs.69559 KIAA1096 protein 7.2 72 1 5.7 

123137 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 9,9 351 36 13.9 

123619 AA602964 gb:no97c02.s1 NCI_C6AP.Pr2 Homo sapiens 8.5 85 1 4.3 

30 123709 AA706910 Hs.112742 ESTs 3.9 60 16 4.8 

124006 AI147155 Hs.270016 ESTs 5.8 321 55 17 

124059 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypotheti 10.4 880 85 5.3 

124308 AA249027 Hs.241507 ribosomal protein S6 105 105 1 9.9 

125279 AW401809 Hs.4779 KIAA1 150 protein 13.1 131 1 5.1 

35 125617 AA287921 Hs.164950 ESTs 6.7 67 1 6 

127439 D60237 Hs.14368 SH3 domain binding glutamic acid-rich pr 30.6 306 4 265 

128305 AI954968 Hs^79009 matrix Gla protein 7.5 75 1 6.5 . 

128482 A1694143 Hs.296251 programmed ceO death 4 7.2 72 1 5.8 

128790 AF026692 Hs.105700 secreted frizzled-related protBln 4 17.4 409 24 7.8 

40 128925 R67419 Hs.21651 Homo sapiens cDNA FU 12900 Ms, clone NT 7.1 392 56 3.6 

129017 AA115333 Hs.10796B ESTs 8.2 82 1 7.4 

129229 AF013758 Hs.109643 polyadenylate binding proteWnteracfln 7.1 71 1 6.2 

129337 NM_014918Hs.110488 KIAA0990 protein 9.5 95 1 8.5 

. 129366 BE220806 Hs.184697 Homo sapiens done 23785 mRNA sequence 7.1 150 21 145 

45 129821 AB028945 Hs.12696 cortactin SH3 domain-binding protein 11.4 114 1 10 

130036 BE061916 Hs.125849 chromosome 8 open reading frame 2 6.7 67 1 5.7 

130057 AF027153 Hs.324787 solute carrier family 5 (inositol transp .1111 

130095 AK001635 Hs.14838 hypothetical protein FU10773 14.6 219 15 7.6 

130343 AB040914. Hs.278628 KIAA1481 protein 13.2 331 25 12.4 

50 130385 AW067800 Hs.155223 stanntacalcin 2 72.2 722 1 1.9 

130407 BE385099 Hs.334727 hypothetical protein MGC3017 6.5 65 4 5.3 

130441 U63630 Hs. 155637 protein kinase, DNA-actlvated, catalyfc 6.1 61 1 5.7 

130455 D90041 Hs. 1 55956 N-acetyitransferase 1 (arytemine Nacety 10.8 706 66 9.2 

130604 AA383256 Hs.1657 estrogen receptor 1 32.2 322 1 4.7 

55 130617 M90516 Hs.1674 glutamine-fmdose-6i>hosphatetransamin 10 100 1 7.6 

130712 AJ271881 Hs.279762 bromodomainoontaJning 7 175 175 2 12.8 

131148 AW953575 Hs.303125 p53-induced protein PI6PC1 3.8 585 153 3.7 

131388 NfVL014810Hs.92200 K1AA0480 gene product 7.6 76 1 5 

131564 T93500 Hs.28792 Homo sapiens cDNA FU1 1041 fis, clone PL 4.7 381 81 6.4 

60 131742 AA961420 Hs.31433 ESTs 11.7 117 1 10.1 

131877 J04088 Hs.1 56346 topoisomerase (DNA) tl alpha (170kD) 6.8 68 1 5.6 

131985 AA503020 Hs.36563 hypothetical protein FU22418 40.2 402 1 4 

132316 U28831 Hs.44566 KIAA1 641 protein 18.6 186 10 15 

132528 T78736 Hs.50758 SMC4 (structural maintenance of chromoso 9.3 93 1 8.4 

65 132742 AA025480 Hs.292812 ESTs, Weakly similar to T33468 hypothec 65 65 1 5.6 

132990 X77343 Hs.334334 transcription factor AP-2 alpha (activat 12.7 311 25 24 

133015 AJ002744 Hs.246315 UDP-N-acetyl^haO^alactosamlne^olyp 4.6 427 93 10.4 
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133199 AF231981 Hs.250175 homotog of yeast long chain poiyunsatura 


3 


816 


275 


3.9 


133240 AK001489 Hs.242894 ADP-ribosylation factor-tike 1 


8.1 


81 


1 


4.6 


133271 Z48633 Hs.283742 H^apiens mRNA for retrotransposon 


12.4 


124 


6 


10.8 


133640 AW246428 Hs75355 ubiquitin-cbnjugating enzyme E2N (homoto 


8.5 


85 


1 


72 


133746 AW410O35 Ks.75862 MAD (mothers against decapentaplegic, Dr 


9.3 


93 


1 


7.8 


133999 AAS35244 Hs.78305 RAB2, member RAS oncogene family 


7.8 


78 


1 


5.6 


134110 U41060 Hs79136 LIV-1 protein, estrogen regulated 


4.5 


1472 330 


2.1 


134485 X82153 Hs.83942 cathepsln K (pycnodysostosis) 


34.3 


411 


12 


5.1 


134654 AK001741 Hs.8739 hypothetical protein FU10879 


6.4 


64 


1 


5.1 


134880 AI879195 Hs.90606 15 kDa selenoprotein 


57 


57 


1 


5 


135029 H58818 Hs.187579 hydroxy8teroid(17-beta) dehydrogenase 7 


11-5 


115 


1 


10 


135389 U05237 Hs.99872 fetal Alzheimer antigen 


20.6 


206 


4 


19.1 


128305 AI954968 Hs.279009 matrix Gla protein 


9.4 


94 


3 


5.3 
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TABLE 11A 



Table 11 A shows the accession numbers for those pkeys lacking unigenelD's for Table 1 1 . 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland Cahfomia). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



10 



Pkey: 

CAT number 
Accession: 



Unique Eos probeset identifier number 
Gene cluster number 
Genbank accession numbers 



15 



Pkey 



CAT number Accession 



20 



123619 371681J AA602964 AA609200 
1 13702 genbankJT97307 T97307 
114988 genbanKuAA251089 AA251089 
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TABLE 12: Figure 12 from BRCA 001-3 PCX 



5 Table 12 depicts a preferred group of genes upregulated in tumor tissue compared to normal 
breast tissue. 



Pkey: Unique Eos probeset identifier number 

1 0 ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Uriigene number 

. Unigene Title: Unigene 9ene titie 

R1: Ratio of tumor to normal body tissue 

R£ Ratio of 90* percentile tumor to body 

15 R3: Ratio of 75* percentile body to tumor 

R4: Ratio of tumor to normal breast tissue 

Pkey ExAccn UnigenelD Unigene Titie 

20 100131 D12485 .Hs.11951 phosphodiesterase I (PC-1) 

105500 AW602166 Hs.222399 ESTs 

112244 AB029000 Hs.70823 WAA1077 protein 

114124 W57554 Hs/125019 ESTs 

119771 AI905687 Hs.2533 ESTs 

25 121723 AA243499 Hs.104800 ESTs 

128790 AF026692 Hs.105700 secreted mzzled-related protein 4 

131148 AW953575 Hs.303125 ESTs 

131985 AA503020 Hs.36563 ESTs . . 

133199 AF231981 Hs.250175 Homo sapiens done 23904 mRN A sequence 3 



R1 


R2 


R3 


R4 


13.2 


244 


19 


9.9 


25.4 


508 


20 


3 


5.7 


567 


100 


6.7 


24.2 


242 


10 


5.6 


3.5 


2073 


595 


2.1 


2.9 


214 


74 


3.7 


17.4 


409 


'24 


7.8 


3.8 


585 


153 


3.7 


40^ 


402 


1 


4 


8 3 


816 


275 


3.9 
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TABLE 13: Table 1 from BRCA 001-5 US 
5 Table 13 depicts a preferred group of genes upregulated in breast cancer cells. 
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Pkey: 
ExAccn: 
UnigenelD: 
Unigene Tide: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title * 

Ratio of tumor to normal body tissue 



Pkey ExAccn UnlGenelD Unigene Title 
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100038 
100039 
100040 
100041 
100082 
100091 
100100 
100103 
100114 
100121 
100123 
100128 
100131 
100137 
100144 
100147 
100154 
100164 
100169 
100190 
100203 
100209 
100215 
100216 
100219 
100227 
100246 
100287 
100294 
100307 
100335 
100340 
100355 
100363 
100368 
100372 
100375 
100379 
100387 
100393 
100398 
100405 
100406 
100409 
100421 
100446 
100447 
100448 



M97935 

M97935 

M97935 

M97935 

AB003103 

AF000177 

AF006084 

AF007875 

000596 

D10495 

D10523 

D11094 

D12485 

D13627 

013643 

D13666 

D14657 

D14812 

D14878 

D21090 

D25538 

D26308 

026598 

D26599 

D28137 

D28915 

D31888 

D43950 

D49396 

D50525 

D63391 

063487 

D78129 

D78514 

D79987 . 

D79997 

080004 

D62060 

D83777 

D84145 

D84557 

D86425 

D86479 

D86957 

D86985 

D87464 

D87465 

D87469 



Hs.4295 

Hs.1 11783 

Hs.11538 

Hs.5085 

Hs.82962 

Hs.155342 

Hs.168669 

Hs.61153 

Hs.1 1951 

Hs.15071 

Hs.75616 

Hs.136348 

Hs.81892 

Hs.173714 

Hs.82043 

Hs.178658 

Hs.172199 

Hs.76289 

Hs.82793 

Hs.1390 

Hs.118110 

Hs.82316 

Hs.78398 

Hs.1600 

Hs.75454 

Hs.699 

Hs.6793 

Hs.82553 

Hs.71465 

Hs.78563 

Hs.153479 

Hs.184339 

Hs.75909 

Hs.278721 

Hs.75137 

Hs.39913 

Hs.155462 

Hs.82733 

Hs.1 18397 

Hs.80712 

Hs.79276 

Hs.10037 

Hs.74583 

Hs.57652 



control 
control 
control 
control 

proteasbme (prosome; macropain) 26S sub 
Lsm1 protein 

actin related protein 2/3 complex; subunit 

dolichyl-phosphate mannosyttransferase p 

thymidyfate synthetase 

protein kinase C; delta 

oxoglutarate dehydrogenase (lipoamtde) 

proteasome (prosome; macropain) 26S sub 

phosphodiesterase l/nucleotide pyrophosp 

chaperonfn containing TCP1; subunit 8 (t 

Human mRNA for KIAA0018 gene; comp 

osteoblast specific factor 2 (fascicfln Nike 

KIAA0101 gene product 

MORF-relatedgeneX 

D123 gene product 

RAD23 (S. cerevlslae) homobg B 

adenylate cyclase 7 

blliverdin reductase B (flavin reductase (N 
proteasome (prosome; macropain) subunit 
proteasome (prosome; macropain) subunit 
bone marrow stromal cell antigen 2 
interferon-induced; hepatitis (^associated 
KIAA0071 protein 

chaperonln containing TCP1; subunit 5 (e 
antioxidant protein 1 
hypothetical protein 



K1AA0153 protein 

Homo sapiens mRNA for squalene epoxid 
ubiquitin-conjugating enzyme E26 1 (horn 
extra spindle poles; S. cerevlslae; hornolo 
KIAA0175 gene product 
KIAA0182 protein 

Ke4 gene; mouse; human homolog of 
KIAA0193gene product 
novel RGD-containfng protein 
minichromosome maintenance deficient (m 
nidogen 2 

AE-binding protein 1 

KIAA0202 protein . 

Human mRNA for KIAA0232 gene; comp 

KIAA0274 gene product 

KIAA0275 gene product 

EGF-like-domain; multiple 2 
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R1 

16.7. 

6.3 

8.3 

14.8 

7.5 

4.9 

4.7 

13.4 

15.9 

4.6 

7.5 

4.4 

8.7 

9.5 

6 

8.5 

10.5 

4.6 

7.9 

5.6 

9.9 

4.9 

14.2 

113 

5.7 

5.7 

7.4 

5.6 

12.9 

8.4 

6.8 

4.4 

12.6 

4.6 

6.5 

8.4 

4.5 

8.1 

10.7 

7.2 

11 

5.4 

4.3 

119 

9.7 

6.4 

10 

6.2 
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101478 


M23379 


Hs.758 


101484 


M24594 


HSJ20315 


101539 


M30818 


Hs.926 


101540 


M30938 


Hs.84981 


101544 


M31169 




101552 


M31642 


Hs.82314 


101580 


M34677 


Hs.83363 


101600 


M37583 


Hs.119192 


101663 


M60750 


Hs.2178 


101664 


M60752 


Hs.121017 


101667 


M60856 


HS.79110 


101684 


M63256 


Hs.75124 


101702 


M64929 


Hs.179574 


101754 


M77142 


Hs.239489 


101758 


M77836 


Hs.79217 


101767 


M81057 
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ATP8se; H+ transporting; lysosomal (vacu 

fucosyltransferase 8 (alpha (1;6) fucosyltr 

RasJJke Protein To4 

Carboxyt Methyltransferase, /Aspartate, A 

Collagen, Type VEl. Alpha 1 

Ribosornal Protein L39 Homolog 

Epten, AIL Splice 11 

Epican, Alt Splice 12 

Rbronectin, AIL Splice 1 

Mucin I.EplthelnU, AIL Splice 9 

Basic Transcription Factor, 44 Kda Subun 

Cytosolic Acetoacetyl-Coenzyme A Thto 

Rad2 

Protein Kinase Pitslre, Alpha, AH Splice 

CathepsJnB 

Transcription Factor Ilia 

Guanosine 5 r -Monophosphate Synthase 

Ras Inhibitor Inf 

Endothelial Cell Growth Factor 1 
Oncogene E6-Ap, Papillomavirus 
lymphocyte cytosolic protein 1 (L-ptastin 



interferon; gamma-lnducible protein 30 
dlaphorase (NADH/NAOPH) (cytochrom 
acid phosphatase 5; tartrate resistant 
biglycan 

matrix metaiioproteinase 9 (gelatinase B; 
replication protein A2 (32kD) 
Human MHC class II HLA-DObefa mRN 
glucose phosphate Isomerase 
voltage^ependent anbn channel 1 
chemoidne (C-X-C motif); receptor 4 (fus 
Human neuropeptide Y receptor Y1 (NPY 
heat shock 70kD protein 4 



H2A histone family; member 0 
cycllrKlependent kinase mhibitor 3 (CDK 
sorbitol dehydrogenase 
glycogen synthase kinase 3 beta 
proteasome (prosome; macropaln) 26S sub 
inner membrane protein; mitochondrial (m 
Homo sapiens (cell line HL-6) alpha topo 
phosphomevalonate kinase 
C0X17 (yeast) homolog; cytochrome cox 
interferon-stimulated protein; 15 kOa 
proliferating cell nuclear antigen 
heterogeneous nuclear ribonudeoprotein C 
apoifpeprotein C-l 

NADH dehydrogenase (ubiquinone) flavo . 
Human somatic cytochrome c (HCS) gene 
protective protein for beta-galactosldase ( 
RAS p21 protein activator (GTPase activa 
Interferon-induced protein 56 
myxovirus (influenza) resistance 2; homo) 
X-ray repair complementing deflective rep 
Human propionyt-CoA carboxylase beta-s 
hypoxanthlne phosphoribosyltransferase 1 
DNA segment on chromosome X (unique) 
H2A histone family; member Z 
H2B histone family; liTemberA 
H2A histone family; member A 
nucleoli 



protein phosphatase 2 (formerly 2A); regu 
TIA1 cytotoxic granule-associated RNA-b 
pyrrollrie-o^arboxylate reductase 1 
carboxypeptWaseBI (tissue) 
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101770 M81601 Hs.78869 transcription elongation factor A (Sll); 1 

101791 M83822 Hs.62354 cell division cycle 4-fike 

101803 M86546 Hs.155691 pre-B-cefi leukemia transcription factor 1 

101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 

101839 M93036 Hs.692 membrane component; chromosomal 4; su 

101851 M94250 Hs.82045 midkine (neurite growth-promoting factor 

101688 M99701 Hs.95243 transcriptfon elongation factor A {Sl\\4skB 

101973 S82597 Hs.80120 UDP-N-acetyl-a!pha^>galadosamine:po . 

101991 U00968 Hs.166 Human SREBP-1 mRNA; complete cds 

102009 U02680 Hs.82643 protein tyrosine kinase 9 

102025 U03911 Hs.78934 mutS (E. cofi) homolog 2 (colon cancer; n 

102047 U07158 Hs.83734 syntaxin. 4A (placental) 

102051 U07550 Hs.1197 heat shock 10kD protein 1 (chaperonin 10 

102083 U10323 ,Hs.75117 Interleukln enhancer Wndlng factor 2; 45k 

102095 U11313 Hs.75760 sterol carrier protein 2 

102130 U15009 Hs.1575 • small nuclear ribonudeoprotein D3 polyp 

102133 U15173 Hs.155596 BCL2/adenovirus E1B 19kD-lnteracting p 

102148 U16954 Hs.75823 ALL1-fused gene from chromosome 1q 

102179 U19713 Hs.76364 allograft inflammatory factor 1 

102180 U19716 Hs.83551 ' microfibrillar-associated protein 2 
102193 U20758 Hs.313, secreted phosphoprotein 1 (osteopontin; b 
102198 U21090 HsJA59S polymerase (DMA directed); delta 2; regu 
102202 U21931 Hs.574 fructose-bisphosphatase 1 

102209 . U22970 Hs.265827 interferon; alpha-inducibte protein (done 

102211 U23070 Hs.78776 putative transmembrane protein 

102220 U24389 Hs.65436 tysyl oxidase-fike 1 

102224 U24704 Hs.148495 proteasome (prosome; macropain) 26S sub 

102234 U26312 Hs.278554 chromobox homolog 3 (Drosophfla HPi g 

102250 U28014 Hs.74122 caspase 4; apoptosls-related cysteine prot 

102260 U28386 Hs.159557. karyophertoalpha2(RAG<»hort1;lmpo 

102261 U28488 Hs.155935 complement component 3a receptor 1 
102273 U30888 Hs.75981 ubiquitin spedfic protease 14 (tRNA-guan 
102298 U32849 Hs.54483 N-myc (and STAT) interactor 

102302 U33052 Hs.69171 protein kinase C-Uke 2 

102305 U33288 Hs.90073 chromosome segregation 1 (yeast homblo 

102320 U34683 Hs.823Z7 glutathione synthetase 

102330 U35451 Hs.77254 chromobox homolog 1 (Drosophila HP1 b 

102348 U37519 Hs.87539 aldehyde dehydrogenase 8 

102361 U39400 Hs.75859 chromosome 1 1 open reading frame 4 

102362 U39412 Hs.75932 N-ethylmaleimide-sensitive factor attachm 
102369 U39840 Hs.299867 hepatocyte nudear factor 3; alpha 
102395 U41767 Hs.92208 a disintegrin and metalloproteinase domai 
102409 Hs. 118725 seJenophosphate synthetase 2 

102418 U43923 Hs.79058 suppressor of Ty (Sxerevislae) 4 homolog 

102425 U44772 Hs.3873 palmitoyl-protein thloesterase (cerafaMipo 

102457 U48807 Hs.2359 dual specificity phosphatase 4 

102465 U49352 Hs.81548 2;4-dienoyl CoA reductase 1; milochondri 

102495 U51240 Hs79356 Lysosomal-associated murtispanning mem 

102534 U56833 Hs.198307 von HippeWJndau binding protein 1 

102546 U57877 Hs.3577 succinate dehydrogenase complex; subunl 

102549 U58046 Hs.198899 eukaryotic translation Initiation factor 3; s 

102557 U58766 Hs.264428 tissue specific transplantation antigen P35 

102562 U59309 Hs.75653 fumarate hydratase 

102568 U59877 Hs.223025 RAB31; member RAS oncogene family 

102580 U60808 Hs.152981 CDP-diacylglycerol synthase (phosphatid 

102581 U61145 Hs.77256 enhancer of zeste (Drosophila) homolog 2 

102590 U62136 Hs.79300 Homo sapiens enterocyte differentiation a 

102591 U62325 Hs.324125 amyloid beta (A4) precursor protein-bindi 

102592 U62389 Hs.11223 Human putative cytosolic NADP-depende 

102617 U65928 Hs. 198767 Jun activation domain binding protein 

102618 U65932 Hs.81071 extracellular matrix protein 1 

102638 U67319 Hs.9216 caspase 7; apoptosis-related cysteine prot 

102663 U70322 Hs.168075 karyopherin (importin) beta 2 

102666 U706S0 Hs.2799W ATX1 (antioxidant protein 1; yeast) homo 

102679 U72661 Hs.11342 ninjurin 1; nerve injury-induced protein-1 

102687 U73379 Hs.93002 ubiquitin carrier protein E2-C 

102704 U76638 Hs.54089 BRCA1 associated RING domain 1 
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X96506 
X97065 
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Hs.80658 
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Hs.77274 

. Hs.202833 

Hs.74649 
Hs.14601 
Hs.118638 
Hs.2707 
Hs.1408 
Hs.81134 
Hs.1 17950 
Hs.83169 
Hs.155324 
Hs;74077 
Hs.2934 
Hs.82932 
Hs.296281 
Hs.76913 
Hs.4147 
Hs.171834 
. Hs.5337 
Hs.99995 
Hs.2942 
Hs.2903 
Hs.75724 
Hs.78580 
Hs.2642 
Hs.77367 

Hs.31314 
Hs.16003 
Hs.44313 
Hs.74637 
. Hs.289114 
Hs.80680 
Hs.9078 
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Hs.324728 
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small inducible cytokine subfemily A (Cy 
Human done 23759 mRNA; partial cds 
asparaglnykRNA synthetase 
Human clone 23801 mRNA sequence 
Human clone 23948 mRNA sequence, 
tumor susceptibility gene 101 
nucleolar protein p40 
E4F transcription factor 1 
pyridoxal (pyridoxine; vitamin B6) kinase 
nuclear RNA heiicase; DECD variant of D 
Human done 23773 mRNA sequence 
carboxypeptidaseO 

chaperonln containing TCP1; suburb 2 (b 
Human uncoupling protein homoiog (UCP 
Human D9 splice variant 8 mRNA; comp 
Homo sapiens Nedd-4-like ubiquitin-prot 
plasminogen activator, urokinase 
heme oxygenase (decyding) 1 
aldolase A; fructose-bisphosphate 
cytochrome c oxidase subunft vie 
hematopoietic cell-specific Lyn substrate 
non-metastatic ceils 1; protein (NM23A) 
61 to S phase transition 1 
trefoil factor 1 (breast cancer; estrogen-tad 
interteukin 1 receptor antagonist 
multifunctional polypeptide similar to SA 
matrix metailoproteinase 1 (interstitial col 
matrix metailoproteinase 11 (stromelysin 
proteasome (prosome; macropain) subunit 
ribonucleotide reductase Ml polypeptide 
cyclin D1 (PRAD1: parathyroid adenomat 
mterleukin enhancer binding factor 1 
proteasome (prosome; macropain) subunit 
translocating chain-assodating membrane 
PCTAIRE protein kinase 1 
Isodtrate dehydrogenase 2 (NADP+); mit 
intercellular adhesion molecule 3 
macrophage stimulating 1 receptor (c-met 
protein phosphatase 4 (formerly X); cataiy 
coatomer protein complex; subunit beta 2 
DEAD/H (Asp^IuWMa-Asp/Hls) box pb! 
eukaryotic translation elongation factor 1 
monokine induced by gamma interferon 
Human endogenous retrovirus mRNA for 
retinoblastomVblnding protein 7 
retinoblastoma-binding protein 4 
v-rel avian reticuioendotheRosis viral onco 
testis enhanced gene transcript 
hexabrachlon (tenascin C; cytotactin) 
lung resistance-related protein 
immature colon carcinoma transcript 1 
coatomer protein complex; subunit beta 
SMA5 

small nudear ribonudeoproteln potypepti 
serine/threonine kinase 9 
uracii-ONAglycosylase . 
SUlTICsulfotransferase 
chloride channel; nudeotkJe-sensitive; 1A 
synaptobrevin-Gte 1 
methionine-tRNA synthetase 
oofilin 1 (non-muscle) 
DRI-assoctated protein 1 (negative cofact 
Sec23 (S. cerevlsiae) homoiog B 
adaptor-related protein complex 2; stgma 
H^apiens mRNA for Ptg-12 protein 
translocase of Inner mitochondrial membr 
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103464 Y00285 Hs.76473 insuUn-like growth factor 2 receptor - — 4.2 

103470 Y00796 Hs.174103 Integrin; alpha L (antigen CD11A (p180); 4.5 

103494 Y0S991 Hs.83050 phosphatidyiinositol 3-fdnase-associated p 4.1 

103505 Y09912 Hs.33102 transcription factor AP-2 beta (activating 4.5 

103547 Z14982 Hs.180062 proteasome (prosome; macropain) subunrt 4.3 

103551 Z15115 Hs.75248 topolsomerase (DNA) II beta(180kD) 4 

103565 Z22548 Hs.146354 thtoredoxin-dependent peroxide reductase 7.6 

103567 Z29083 Hs.82128 . 5T4 oncofetal trophoblast glycoprotein 14.6 

103621 Z47727 Hs.150675 polymerase (RNA) II (DNA directed) pol 6.3 

103622 Z48042 Hs.278672 membrane component chromosome 11; 3 4.4 
103658 Z74615 Hs.172928 collagen; type I; alpha 1 . 5.9 
103680 Z93784 Homo sapiens DNA sequence from PAC 4.4 
103772 AA092473 Hs.278554 chromobox homolog 3 (Drosophaa HP1 g 4.9 
103774 AA092898 Hs.92918 ESTs; Weakly similar to R07G3.8 [C.eteg 6.1 
103821 AA157623 Hs.198793 KIAA0750 gene product 23.3 
103835 AA172215 Hs.93748 ESTs; Moderately similar to TRANSCRIP 4 
103886 AA236384 Hs.105737 EST s; Weakly similar to gene 9306 protel . 4.9 
103890 AA236843 Hs.72085 ESTs; Weakly similar to unknown [S.cere 7.8 
103892 AA243523 Hs.239189 ESTs , 4.8 
104054 AA393432 Hs.7100 ' hypothetical protein 5.3 
104115 AA428090 Hs.26102 ESTs 28.7 
104136 AA442669 Hs.268371 zv68f5.r1 Soares„totaLfetus - Nb2HF8 - 9w 5.7 
104147 AA451992 Hs.283037 ESTs; Highly similar to HSPC039 protein 6.9 
104173 AA476564 Hs.76561 ESTs; Weakly similar to finger protein HZ 5.2 
104181 AA479521 Hs.283740 ESTs 7.8 
104183 AA480838 Hs.1 14309 ESTs 5.1 
104192 AA486946 Hs.21321 Homo sapiens mRNA; cDNA DKFZp564 ' 4.3 
104209 AB000221 Hs.16530 small Inducible cytokine subfamily A (Cy 12.3 
104234 AB002357 Hs.168212 kinesln family member 3B , 6.2 
104271 C01687 Hs.7381 . ATP synthase; H+ transporting; mtochon 4.2 
104278 C02582 Hs.109253 ESTs; Hfahfy similar to N-terminal acetyl 4.5 
104307 D52818 Hs.1 11680 endosulfine alpha 4.7 
104309 D55869 Hs.284123 Homo sapiens mRNA full length Insert cD 4.2 
104370 H19378 Hs.21851 Homo sapiens mRNA; cDNA DKFZp586 6.4 
104446 L44497 Hs.7351 ESTs 4.9 
104453 M19169 Hs.123114 cystatinSN 11.6 
104476 N33807 Hs.324275 protease; serine; 15 5.6 
104558 R56678 Hs.88959 Human DNA sequence from clone 967N2 6.3 
104592 R81003 Hs.325620 serine protease; umbilical endothelium 13.6 
104634 AA004274 Hs.19151 ESTs . 1 6.3 
104636 AA004415 Hs.106106 ESTs 10.1 
104658 AA007145 Hs.27268 Homo sapiens mRNA; cDNA DKFZp564 4.3 
104667 AA007234 Hs.3009B ESTs 16.6 
104675 AA009596 Hs.301553 ESTs; Moderately similar to llll ALU SU ■ 4.6 
104767 AA025534. Hs.8852 . ESTs 4.8 
104785 AA027163 Hs.7942 ESTs 8.1 
104791 AA029046 Hs.301871 ESTs; Moderately similar to cAMP Indue 10.9 
104804 AA031357 Hs.31803 ESTs; Weakly similar to N-WASP (H.sap 5.5 
104807 AA032147 Hs.23296 ESTs 10.4 
104837 AA039469 Hs.21126 ESTs; Weakly similar to KIAA0299IH A 4.6 
104849 AA040270 Hs.241507 Homo sapiens mRNA; cDNA DKFZp564 4.3 
104867 AA045481 Hs.225979 Human gene from PACs 37M17 and 305B 4.5 
104884 AA053021 Hs.14511 SCO (cytochrome oxidase deficient; yeast 4.7 
104906 AA055809 Hs.26802 ESTs; Weakly similar to phosphoproteln { 8.8 

. 104919 AA057193 Hs.25252 ESTs 6.5 

104921 AA057839 Hs.1508 ESTs 4.2 

104926 AA058846 Hs.33363 DKFZP434N093 protein 7 

104938 AA064627 Hs.318725 ESTs; Highly similar to CGI-72 protein [H 7.1 

104943 AA065217 Hs.114218 ESTs 5.7 

104957 AA074919 Hs.10026 ESTs; Weakly similar to ORF YJL063c [S 4.7 

104961 AA076672 Hs.33905 ESTs 5.5 

104968 AA084602 Hs.29669 ESTs 4.3 

104975 AA086071 Hs.50758 chromosome-associated polypeptide C 8.3 

104977 AA08B228 Hs.18272 ESTs 62 

104978 AA088458 Hs.19322 ESTs 6.7 
104987 AA101723 Hs.1 1861 ESTs 92 
105002 AA113266 Hs.182704 ESTs; Moderately similar to alternatively 6.9 
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105376 
105386 
105397 
105399 
105400 
105404 
105409 
105436 
105483 
105493 
105495 
105496 
105500 
105507 
105538 
105544 
105546 
105549 
105551 
105560 
105565 
105566 
105575 
105584 
105596 
105604 
105610 
105621 
105627 
105638 
105645 
105650 
105666 
105674 
105687 
105700 
105705 
105709 



AA1 16036 
AA121879 
AA126855 
AA127964 
AA128486 
AA130349 
AA134968 
AA142858 
AA147884 
AA148859 
AA149051 
AA152302 
AA158132 
AA159501 
AA165333 
AA171736 
M176690 
AA191512 
AA205072 
AA211388 
AA227428 
AA227448 
AA227871 
AA227926 
AA228122 
AA233451 
AA233790 
AA233854 
AA235286 
AA236559 
AA236950 
AA242868 
AA243007 
AA243052 
AA243303 
AA243562 
AA252172 
AA255874 
AA256268 
AA256317 
AA256323 
AA256485 
AA256678 
AA258860 
AA261954 
AA262032 
AA262417 
AA262477 
AA262783 
AA278302 
AA278323 
AA278717 
AA279012 
AA27941B 
AA279787 
AA279991 
AA280865 
AA281245 
AA2B1599 
AA282138 
AA282347 
AA283930 
AA284755 
AA286809 
AA287643 
AA290767 
AA291268 



Hs.9329 

Hs.9280 

Hs.13268 

Hs.274329 

Hs.8859 

Hs.36475 

Hs.36529 

Hs.37810 

Hs.9812 

Hs.179909 

Hs.32405 

Hs.25035 

Hs.301957 

HS.2472B0 

Hs.24808 

Hs.35947 

Hs.4084 

Hs.28005 

Hs.227743 

Hs.7750 

Hs.972? 

Hs.5003 

Hs.6361 

Hs.6682 

Hs.281866 

Hs.183858 

Hs.4104 

Hs.23348 

Hs.157078 

Hs.8766 

Hs.8115 

Hs.7395 

Hs.16420 

Hs.65648 

Hs^1187 

Hs.301855 

Hs.237856 

Hs.23458 

Hs.10283 

Hs.28785 

Hs.301997 

Hs.222399 

Hs.226318 

Hs.32597 

Hs.24678 

Hs.268281 

Hs.5415 

Hs.25292 

Ms.306915 

Hs.18349 

Hs.17481 

Hs.12772 

Hs.3454 

Hs.18490 

Hs.15467 

Hs.99872 

Hs.6375 

Hs.23317 

Hs.247817 

Hs.11325 

Hs.25635 

Hs.34906 

Hs.279789 

Hs.28423 

Hs.35254 

Hs.101282 

Hs.26761 



chromosome 20 open reading frame 1 
proteasome (prosome; macropain) subunit 
EST8 

TP53 target gene 1 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs ■ 

ESTs; Weakly similar to 111! ALU SUBFA 
ESTs 

DKFZP566G223 protein 

ESTs; Weakly similar to contains simOari 

HBV associated factor 

ESTs 

methyl-CpG binding domain protein 4 
K1AA1025 protein 

Homo sapiens mRNA; cDNA DKFZp564 



ESTs 

ESTs; Weakly similar to KIAA0512 prote 
KIAA0456 protein 
MEK partner 1 
ESTs - 

ATPase; H+ transporting; lysosomal (vaco 
transcriptional intermediary factor 1 
ESTs; Weakly similar to cDNA EST yk38 
S-phase kinase-associated protein 2 (p45) 
ESTs 

ESTs; Weakly similar to HI! ALU SUBFA 
ESTs 

ESTs; Weakly similar to house-keeping p 

ESTs; Highly similar to SH3 domain-bind 

RNA binding motif protein 8 

ESTs 

ESTs 

ESTs; Moderately similar to cAMP Indue 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp586 
DKFZP434N126 protein 
CGI-96 protein 

ESTs; Moderately similar to CCR4-associ 
ring finger protein (C3H2C3 type) 6 
ESTs 

ESTs; Weakly simflar to 62D9.a [Djneian 
ESTs 

ribonuclease HI; large subunit 
ESTs 

ESTs; Weakly similar to partial CDS [C.e 
Homo sapiens clone 24606 mRNA sequen 
ESTs 

ESTs; Weakly similar to KIAA0665 prote 
ESTs 

ESTs; Moderately similar to putative pho 
ESTs; Weakly slmflar to trithorax homoto 
Homo sapiens mRNA; cDNA DKFZp564 
ESTs 

Homo sapiens mRNA for for histone H2B 
ESTs 

ESTs; Highly similar to HSPC003 [H.sap 
ESTs 

CDW52 antigen (CAMPATH-1 antigen) 
ESTs 

ESTs; Weakly slmBar to hypothetical pro 
Homo sapiens mRNA; cDNA DKFZp434 
DKFZP586L0724 protein 
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ESTs* Weaklv similar to F55C125 IC ela 


45 




106459 


AA449741 


Hs4029 


alloma-amDiified scnLience-41 


48 

» * w 




106462 


AA449912 


Hs.30532 


ESTs; Highly similar to CGI-77 protein [H 


5^ 




106466 


AA450047 


Hs.14770 


ESTs 


6.6 


55 


106479 


AA450351 


Hs.75251 


ESTs 


12.4 


106494 


AA452108 


Hs.18387 


transcription factor AP-2 alpha (activating 


4.5 




106503 


AA452411 


Hs.29879 


EST s; Highly simitar to mediator [H^aple 


5.1 




106507 


AA452584 


Hs.267819 


protein phosphatase 1; regulatory (inhibito 


4.9 




106533 


AA453786 


Hs.145998 


ESTs 


8.3 


60 


106568 


AA455970 


Hs.28285 


patched related protein translocated in ren 


7.6 


106586 


AA456598 


Hs.57787 


ESTs 


62 




106589 


AA456646 


Hs.28661 


ESTs 


4.8 




106606 


AA457730 


Hs.283437 


Homo sapiens clone 23851 mRNA sequen 


4.4 




106611 


AA458904 


Hs.26267 


ESTs; Weakly similar to torslnA [Rsapie 


7 


65 


106614 


AA458934 


Hs.256150 


ESTs 


45 


106628 


AA459657 


Hs.12311 


Homo sapiens done 23570 mRNA sequen 


63 




106637 


AA459961 


Hs.250824 


ESTs 


5.5 




106644 


AA460239 


Hs.12680 


ESTs 


4.4 



.1 
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106664 AA460969 Hs.7510 mriogen-activated protein kinase kinase W 8.4 

106698 AA463746 Hs.29403 ESTs; Weakly similar to PROBABLE AT 5.3 

106719 AA465171 Hs.236844 ESTs 6.6 

106726 AA465339 Hs.3886 ESTs 10.1 

106747 AA476473 Hs.171957 triple functional domain (PTPRF Interacti 10.4 

106759 AA477263 Hs.25584 ESTs 4.2 

106765 AA477717 Hs.306117 Interfeukln 13 receptor; alpha 1 6.9 

106784 AA478558 Hs.227913 APl5-Gke1 5.1 

106831 AA482014 Hs.29463 centrin; EF-hand protein; 3 (CDC31 yeast 5.1 

106836 AA482112 Hs.238707 ESTs 4.8 

106840 AA482548 Hs.5534 ESTs 10.3 

106856 AA486183 Hs.285123 ESTs; Weakly similar to similar to oxyste 6.2 

106865 AA487228 Hs.19479 ESTs 45 

106878 AA488872 Hs,12314 Homo sapiens mRNA; cDNA DKFZp586 7.9 

106888 AA489101 Hs.24734 oxysterol binding protein 6.4 

106895 AA489665 Hs.25245 ESTs 4.6 

106909 AA490323 Hs.250747 SUMO-1 activating enzyme subunit 1 • ,4.2 

106919 AA490885 Hs.21766 ESTs 12.3 

106920 AA490899 Hs.296323 ESTs . 62 

106941 AA496204 Hs.237971 ESTs 4 , 

106942 AA496347 Hs.31314 retinoblastoma-blndlng protein 7 4.8 
106948 AA496788 Hs.21077 KIAA0532 protein 4 
106968 AA504631 Hs.26813 ESTs; Weakly similar to hypothetical 43.2 4.4 
106973 AA505141 Hs.11923 Human DNA sequence from clone 167A1 5.4 

106980 AA521 121 Hs.8858 bromodomaln adjacent to zinc finger dom 4.1 

106981 AA521157 Hs.74101 ESTs 5.7 
106998 AA598461 Hs.1 95464 insulin-like growth factor binding protein ' 18.7 
107008 AA598710 Hs.23740 ESTs 62 
107028 AA599214 Hs.24143 ESTs , 4.1 
107032 AA599472 Hs,247309 succinate-CoA Bgase; GDP-formlng; beta 5.3 

107052 AA600134 Hs.12482 glyceronephosphateO^cyftransferase 4.8 

107053 AA600147 Hs.5741 ESTs; Weakly similar to NADH-cytochro 5.8 
107056 AA600310 Hs.18720 programmed cell death 8 (apoptosls-induc 4.9 
107080 AA609210 Hs.19221 ESTs 8.4 
107102 AA609723 Hs.30652 ESTs 8 
107109 AA609943 Hs.32793 ESTs 95 
107129 AA620553 Hs.4756 flap structure-specific endonuclease 1 4.9 
107132 AA620598 Hs.9052 ESTs 5.3 
107136 AA620795 Hs.8207 ESTs 4 
107140 AA620889 Hs.170088 ESTs 6.7 
107151 AA621169 Hs.8687 ESTs 19 
107159 AA621340 Hs.10600 ESTs; Weakly similar to ORF YKR081 c [ 8.1 
107174 AA621714 Hs.25338 ESTs 8.5 
107217 D51095 Hs.35861 DKF2P586E1621 protein ■ 7.2 
107252 D59971 Hs.25925 ESTs 7.9 
107295 T34527 Hs.80120 UDP^^cety^Ipha^alactosamine^o 5.6 
107299 T40327 Hs.30661 lung reslstanoe-related protein 8.4 
107324 T81665 Hs.276422 DKFZP586G1 122 protein 7.5 

107372 U85625 Hs.8297 ribonudease 6 precursor 4.7 

107373 U85773 Hs.154695 phosphomannomutase 2 4.8 
107481 W58247 Hs.279766 Homo sapiens kinesin superfemily motor 6.3 
107531 Y13936 Hs.17883 protefri phosphatase 1G (formerly 2C); ma 8.3 
107859 AA024835 Hs.47584 potassium voltage-gated channel; delayed 7.3 
107890 AA026030 Hs.61311 ESTs; Weakly similar to CALPAIN 2; LA 7.3 
107908 AA026894 Hs.42826 ESTs 4.9 

108039 AA041341 Hs.46670 ESTs 5.4 

108040 AA041551 Hs.159971 ESTs 8.4 
108102 AA046424 Hs.49433 ESTs; Weakly similar to HYPOTHETICA , 6.6 
108217 AA058686 Hs.62588 ESTs 7.7 
108255 AA063157 Hs.172608 ESTs 4 
108358 AA071514 Hs.1634 ESTs 4 
108609 AA100694 Hs.69499 Human DNA sequence from BAC 15E1 o 5.5 
108647 AA1 12396 Hs.44276 ESTs; Moderately similar to HOMEOBO 143 
108676 AA115562 Hs.274417 Homo sapiens mRNA; cDNA DKFZp564 5.2 
108687 AA120785 Hs.54347 ESTs . 5.6 
108695 AA121315 Hs.70823 KIAA1077 protein 10.5 
108733 AA126422 zn84f1.s1 Stratagene lung carcinoma 9372 4.4 
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108774 AA128125 Hs.71040 ESTs; Moderately similar to CHI GROW • 

108828 AA131584 Hs.273344 DKFZP564O0463 protein 

108872 AA134063 Hs.111680 ESTs 

108884 AA134958 Hs.293591 ESTs 

5 108893 AA135894 Hs.194691 retinoic acid Induced 3 

109008 AA156360 Hs.87128 ESTs 

109010 AA156460 Hs.44229 dual specificity phosphatase 12 

109011 AA156542 Hs72127 ESTs 

109042 AA159525 Hs,71779 Homo sapiens DNA from chromosome 19 

10 109086 AA166695 Hs.270737 tumor necrosis factor (llgand) superfamfly 

109090 AA167006 Hs.70499 ESTs 

109101 AA167708 Hs.52184 ESTs 

109112 AA169379 Hs.257924 ESTs 

109160 AA179387 Hs.301997 DKFZP434N1 26 protein 

15 109166 AA179845 Hs.73625 RAB6 interacting; kinesin-flke (rabklnesln 

109178 AA181600 Hs.283707 ESTs 

109179 AA161902 Hs.192789 ESTs; Weakly similar to Nil ALU SUBFA 
109261 AA195255 Hs.61779 ESTs 

109270 AA195515 Hs.3585 ESTs; Weakly similar to alternatively spil 

20 109277 AA196332 Hs.86Q43 ESTs 

109313 AA206800 Hs.86276 ESTs; Moderately slmflar to zinc finger p 

109415 AA227219 Hs.110826 trinucleotide repeat containing 9 

109454 AA232255 Hs.295232 ESTs 

109467 AA232904 Hs.63187 ESTs 

25 109481 AA233342 Hs.289069 ESTs; Weakly similar to WD40 protefo C 

109508 AA233892 Hs.55902 ESTs; WeaWy similar to 111! ALU SUBFA 

109514 AA234087 Hs.262346 ESTs; WeaWy similar to ORF2: function 

109572 F02027 Hs.171937 ESTs 

109632 F04165 Hs.235873 ESTs; Weakly similar to K1 1C4.2 [Celeg 

3 0 109644. F04477 Hs.291531 ESTs; Moderately simfiar to GLYCERAL 

109703 F09684 Hs.24792 ESTs; Weakly similar to ORF YOR283W 

109726 F10009 Hs.9196 ESTs 

109747 F10161 Hs.22969 ESTs 

109799 F10770 Hs.180378 Homo sapiens clone 669 unknown mRNA 

35 109814 F10979 Hs.153106 Homo sapiens done 23728 mRNA sequen 

110189 H20543 ■ Hs.6278 DKFZP586B1621 protein 

110240 H25577 Hs.176588 ESTs; Weakly similar to CYTOCHROME 

110280 H29285 Hs.32468 ESTs 

110520 H56965 Hs.4082 yr09f06.s1 Soares fetal liver spleen 1 NFL 

40 110561 H59617 Hs.5199 ESTs; Weakly similar to UBIQUITIN-CO 

110707 H95079 Hs.15617 ESTs; Weakly similar to llll ALU SUBFA 

110734 H98714 Hs.24131 ESTs 

110770 N22262 Hs.131705 ESTs 

110780 N23174 Hs.22891 solute carrier family 7 (cationlc amino acl 

45 110787 N24716 Hs.12244 ESTs; Weakly similar to C44B9.1 (Celeg 

110794 N25262 Hs.27931 ESTs 

110799 N26101 Hs.323401 Human ring zinc-finger protein (ZNF127- 

110818 N29454 Hs^7552 ESTs; Weakly similar to putative p150[H 

110839 N30856 Hs.30246 solute carrier family 19 (thiamine transpo 

50 110844 N31952 Hs.167531 Homo sapiens mRNA full length Insert cD 

110854 N32919 Hs.27931 ESTs 

110856 N33063 ESTs; Weakly similar to S164 [H^aplens 

110860 N33438 Hs.170065 ESTs 

110897 N39148 Hs.6880 DKFZP434D156 protein 

55 ' 110915 N46252 Hs.29724 ESTs 

110935 N48787 Hs.305979 protease inhibitor 1 (anti-elastase); alpha- 

110970 N51374 Hs.96870 Homo sapiens mRNA full length Insert cD 

111006 N53375 Hs.166146 Homer; neuronal immediate early gene; 3 

111008 N53388 Hs.7222 ESTs 

60 111018 N54067 Hs.3628 mitogen-activated protein kinase kinase W 

111084 N59543 Hs.15456 PDZ domain containing 1 

111100 N62522 Hs.20450 ESTs 

.111125 N63823 Hs.269115 ESTs 

111132 N64378 Hs.83293 ESTs; Highly similar to unknown function 

65 111139 N64683 Hs.290943 ESTs 

111164 N66857 Hs.14808 ESTs; WeaWy similar, to llll ALU CLASS 

111172 N67102 Hs.21851 Homo sapiens mRNA; cDNADKFZp586 
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111178 N67227 Hs.24633 ESTs _ 

111179 N67239 Hs.10760 ESTs 

111181 N67Z78 Hs.171802 ESTs; Weakly similar to hypothetical pro 

111184 N67437 Hs.243901 Homo sapiens mRNA; cDNA DKFZp564 

111221 N68859 Hs.15119 ESTs 

111223 N68921 Hs.297939 ESTs; Weakly similar to neogenih (H.sap 

111229 N69113 Hs.110855 ESTs 

111241 N69514 Hs.288B80 ESTs; Weakly similar to CGI-82 protein [ 

111268 N70481 Hs.26118 Homo sapiens done 24766 mRNA sequen 

111295 N73275 Hs.21275 ESTs; Weakly similar to ublquiHn-conjug 

111299 N73808 Hs.24936 ESTs 

111336 N79565 Hs.29894 ESTs . 

111357 N91Q23 . Hs.87128 ESTs 

111370 N92915 ,Hs.94631 brefeWin A-inhibited guanine nucleotides 

111806 R33468 Hs.279008 ESTs 

111825 R35885 Hs.286148 stromal antigen 1 

111836 R36228 Hs.25119 ESTs 

111890 R38678 Hs.12365 ESTs 

111923 R39995 Hs.25925 Homo sapiens clone 23860 mRNA sequen 

111942 R40576 Hs.21590 ' ESTs 

111987 R42036 Hs.6763 KIAA0942 protein 

112101 R44793 Hs.296341 adenytyl cyclase-associated protein 2 

112134 R46025 Hs.7413 ESTs 

112197 R49482 Hs.5637 ESTs 

112244 R51309 HsJ0823 KIAA1077 protein 

112253 R51818 Homo sapiens mRNA; cDNA DKFZp566 

112305 R54822 Hs.26244 ESTs 

112449 R63802 Hs.124186 ring finger protein 2 

112483 R66534 Hs.285885 ESTs 

112519 R68631 Hs.11861 . ESTs 

112610 R79392 Hs.23643 ESTs 

112693 R88741 Hs.91065 ESTs; Moderately similar to proliferation 

112751 R93507 Hs.8207 ESTs 

112801 R97486 Hs.157160 protein kinase; DNA-ectivated; catalytic p 

112869 T03313 Hs.4747 dyskeratosis congenita 1;dyskerin 

112871 T03352 Hs.12285 ESTs 

112908 T10065 Hs.3530 TLS-assodated serine-arpinlne protein 

112968 T17119 Hs.102548 glucocorticoid receptor DNA binding tact 

112971 T17185 Hs.83883 ESTs 

112995 T23528 Hs.7155 ESTs; Weakly similar to TYW protein [M 

113047 T25867 Hs.7549 ESTs 

113075 T34660 Hs.6986 ESTs; Weakiy simBar to ALU SUBFA 

113117 T47819 Hs.159153 ESTs 

1 13206 T58044 Hs.241471 ESTs; Moderately similar to llll ALU SU 

1 13248 T63857 yd 6e1 .s1 Stratagene lung (#93721) Homo 

113260 T64896 Hs.287420 ESTs 

113277 T65797 Hs.11774 protein (peptidyt-prolyl ds/trans isomeras 

113278 T65802 Hs.11135 yd1h10.s1 Stratagene lung (#937210) Ho 
113440 T86121 Hs.191445 ESTs 

113523 T90037 Hs.95549 ESTs 

113604 T92735 Hs.296083 ESTs. 

113702 T97307 ESTs; Moderately similar to llll ALU SU 

113783 W19222 Hs.7041 ESTs; Weakly similar to llll ALU SUBFA 

113794 W37382 Hs.11090 ESTs 

113808 W44735 Hs.9286 ESTs 

113811 W44928 Hs.6994 ESTs 

113822 W47350 Hs.17466 retlnbic acid receptor responder (tazaroten 

113823 W47388 Hs.55099 rab6 GTPase activating protein (GAP and 
113836 W56792 Hs.12040 ESTs; Weakly similar to KIAA0881 prole 
113857 W65477 Hs.5297 Homo sapiens mRNA; cDNA DKFZp564 
113886 W72471 Hs.23920 ESTs 

113895 W73738 Hs.12921 ESTs 

113923 W80763 Hs.3849 ESTs; Weakly similar to FKSOS-binding p 

113931 W81205 Hs.3496 ESTs 

113950 W85765 Hs.30504 Homo sapiens mRNA; cDNA DKF2p434 

113970 W86748 Hs.8109 ESTs 

114051 W94942 Hs.177534 dual specificity phosphatase 10 
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14057 W86222 Hs.34192 ESTs 4.8 

14086 Z38266 Hs,288649 Homo sapiens PAC done DJ0777O23 fro 5.1 

14098 Z38347 Hs.1 18338 ESTs; Weakfy similar to simitar to S. cere 6.2 

14109 Z38435 Hs.184108 ribosomal protein L21 4.6 

14124 238595 Hs.125019 ESTs; Highly similar to KIAA0886 prote 22 

14138 Z38763 Hs.15740 amyloid beta (A4) precursor proteln-blndl 8,8 . 

14149 Z38814 Hs.27196 ESTs 4 

14162 238909 Hs.22265 ESTs 7.2 

14177 Z39062 Hs.23740 ESTs 5.3 

14196 239211 Hs.150926 fucose-1i)hosphateguanylyltransferase 4.4 

14208 Z39301 Hs.7859 ESTs 5.1 

14250 239897 Hs.13297 ESTs 12 

14251 239898 Hs.21948 ESTs 147 
14292 Z40715 Hs.184641 detta-6 fatty acid desaturase 194 
14297 240758 'Hs.173091 DKFZP434K151 protein 8.9 
14334 Z41342 Hs.22941 . ESTs 13.7 
14460 AA024604 Hs.26102 ESTs 10.1 
14471 AA028074 Hs.104613 ESTs . 5.7 
14480 AA032243 Hs.151678 UDP-N^cetyklpha-D^atactosamlne:po . 7.3 
14518 AA046407 Hs.106469* suppressor of varl (S.wievtefae) 3-Kke 1 ' 4.3 
14542 AA055768 Hs.293?80 ESTs 11.7 
14549 AA056484 Hs.292833 ESTs 7.3 
14652 AA101416 Hs!l07149 ESTs; Weakly similar to PTB-ASSOCIAT 6.1 
14673 AA1 13303 Hs.95583 transmembrane 4 superfamily member (te 4.3 
14698 AA126951 Hs.110857 ESTs; Highly similar to putative DNA-dlr 7.1 
14767 AA148885 Hs.154443 minichromosome maintenance deficient (S 5.3 
14799 AA159323 Hs.109929 ESTs • 42 
14804 AA160363 Hs.269956 ESTs 4.8 
14811 AA161161 Hs.95907 multiple inositol polyphosphate phosphata 7.1 
14821 AA165313 Hs.55468 ESTs 4.4 
14852 AA235035 Hs.38260 ESTs; Moderately similar to ubiqurtin spe 5 

14901 AA236276 Hs.196437 ESTs; Weakly similar to R26660J; parti 16.9 

14902 AA236359 Hs.39504 ESTs 5.1 
14940 AA243012 Hs.75928 ESTs 8.5 
14965 AA250737 Hs.72472 ESTs 35.1 
15047 AA252627 Hs.82916 homeoboxBS 5.7 
15054 AA252863 Hs.87729 ESTs . 6.2 
15061 AA253217 Hs.41271 ESTs 13 
15082 AA255557 Hs.198269 NADH dehydrogenase (ubiquinone) 1 afo 28.2 
15116 AA256486 Hs.62275 ESTs 1 8.8 
15140 AA258030 Hs.279938 ESTs; Weakly similar to supported by 6E 4.1 

15205 AA262470 Hs.284216 ESTs 8.3 

15206 AA262491 Hs.186572 ESTs . 5.1 
15239 AA278650 Hs.73291 ESTs; Weakly similar to similar to the bet , 4.6 
15242 AA278755 Hs.283732 ESTs 8.3 
15249 AA278961 Hs.71124 ESTs 10.1 
15259 AA279071 Hs.13453 splicing factor 3b; subuniM; 155kD 9.5 
15285 AA279799 Hs.293736 ESTs 5.8 
15291 AA279943 Hs.122579 ESTs 5.1 
15357 AA281793 Hs.72988 ESTs 5 
15377 AA282247 Hs.193063 ESTs 6.1 
15400 AA283198 Hs.89113 ESTs 4.9 
15439 AA284561 Hs.193090 ESTs 5.8 
15471 AA287138 Hs.59346 ESTs; Weakly similar to ASPARTYL-TR 11.7 
15506 AA292537 Hs.45207 Human DNA sequence from clone 620E1 6.6 
15522 AA331393 Hs.47378 ESTs 5.8 
15572 AA398392 Hs.59594 ESTs; Weakly similar to F33G123 gene p 9.7 
15587 AA399264 Hs.283037 ESTs; Highly similar to HSPC039 protein 87 
15600 AA400247 Hs.42173 ESTs '4 
15612 AA400948 Hs.71243 ESTs; Weakly similar to zinc finger prote 8.4 
15646 AA404352 Hs.305971 ESTs 5.3 
15652 AA405098 Hs.38176 ESTs 16.1 

15657 AA405620 Hs.55158 ESTs; Weakly similar to weak similarity t 4.7 

15658 AA405625 Hs.183056 Human DNA sequence from done 34B21 5.1 
15675 AA406546 Hs.82065 Homo sapiens mRNA; cONA DKF2p564 20.5 
15721 AA417102 Hs.90960 ESTs 4.8 
15763 AA421560 ESTs 7 
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ESTs 
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Hs.44238 


ESTs 
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Hs.83883 


Human DNA seauence from clone 718J7 


7.5 
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Hs.42644 


thlo redoxin-Ukfi 
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ESTs 

WW | W 
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ESTs 
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ESTs 


4.9 

■ ■ w 


16222 


AA478415 
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Hs.47144 




46 

» iW 


1R246 


AA479961 


Hs.250646 


ESTs* Hinhlv similar to uhfiuttirwconiuna 

buitf) I uyiiij ouiiiku ifj uun^uiiu^wiijutjci 


4 




AA480886 
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ESTs- Weakrv similar to OXYSTEROL-B 

LOlu, TlvOlvT QUIIIIVll IV w/\ 1 W I 


9.1 




AA481256 


Hs.88201 


PSTq* Wpflkfv similar to Ivsonhosohotina 

lu i Oj vicaiuj oiiiuicu iv ijowpi iwopi iwiipa 


84 

W» w 




M482594 


Hs.272239 




7.2 


16265 


AA482595 


Hs.55189 


ESTs- Weaklv similar to F25B5^ IC elea 


11.1 




AA486550 


Hs.204501 


EST*;* Wpjiklv similar to Wiskott-Aldrich 


6.2 


IO£vO 


AA489046 


Hs.94109 


ESTs 


49 

■ «w 


ifi^nn 

luwUU 


AA489194 Hs.159471 


ES1V Wpaklv similar to snRNP nrotein B * 

Lv i Oj »»coi\ij allium iu giu\Mi pi vioii i lj 


46 




AA490959 Hs.28005 


Homo saofpjis mRNA' cDNA DKFZf55fi4 


5.8 




AA491457 


Hs.48948 


ESTs 

CO 1 9 


43 


IQOO/ 


AA496127 


Hs.44070 


ESTs 


8.4 


IOw9 1 


AA504116 


Hs.82501 


Homo saoiens mRNA 1 cDNA DKFZo434 

9 IVI 1 IV wQUfWf Jv llll«Vl| WWIIn l*lw r 


5.3 


1R3T7 


AA504806 


Hs.90797 


Homo snnians rlona 73620 mRNA saouan 

1 IVI 1 IV wOUId IO bn/llv LMvfcV IIIIMir^ uB^WII 


5.2 

WM» 




AA609204 


Hs.27973 


KIAAQ874 nrotpin 


66 

W«w 


16443 


AA620313 


Hs.190488 


ESTs* WeaWv similar to KERATIN; TYP 


45 


4A47fl 


C13992 


Hs.83484 


CO 19 


45 


1R4ftA 


C14088 




nh/reralrlphuHfuft-nhosnhata riphvHmnpna 
yijvCiaHJciijvo-v^jJi luo^iiiciiJC uci ijui vyBiiu 


56 


IW/O 


D51272 


Hs75337 


nnrJpoJar nhosnhonmtain n130 


41 




D51276 


Hs.81915 


Ipi iVpmia.ss'Qnria'tpH nhftsnhnomiptn n1 R 


58 

W«w 


IDQZD 


F02028 


Hs.81907 


F!5Ts 

CO 19 


49 

i »v 


1P6A7 


F03069 


Hs.15395 


F9Ts- Wpntclv similar to ARGfNYl .TRN 

COlo, VVCBRIJ SUllHal Iv ru\\3l 1 1 1 L 1 r\J l 


61 

V. 1 




F04816 


Hs.92127 


Co IS 


106 


16680 

■ www 


F08813 


Hs.273829 


LINE retrotransoosabie element 1 


42 


16700 


F09983 


Hs.317589 


ESTs 


13 


16724 


F13665 


Hs,65641 


ESTs 


85 


16726 


F13681 


Hs.53913 


ESTs 


• 55 


16732 


F13779 


Hs.165909 


ESTs 


11.6 


16734 


F13789 


Hs.93796 


DKFZP586D2223 protein 


5.4 


16760 


H11054 


Hs.155342 


protein kinase C; delta 


45 


16780 


H22566 


Hs.30098 


ESTs 


5.7 


16786 


H25836 


Hs.301527 


tumor necrosis factor (ilgand) superfamlly 


8.8 


16787 


H28581 


Hs.15641 


ESTs 


85 


16790 


H29532 


Hs.101174 


microtubulB-associated protein tau 


222 


16803 


H47357 


Hs.109701 


ESTs; Moderately similar to weak similar 


6.7 


16877 


H68116 


Hs.168732 


ESTs 


65 


16921 


H72948 


Hs.821 


blglycan 


20.7 


17216 


N20083 


Hs.42792 


ESTs 


4.4 


17232 


N20579 


Hs.61153 


ESTs 


7.4 


17284 


N22162 


Hs.183779 


ESTs; Weakly similar to cDNA EST yk33 


4.1 
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11/044 N24Q4B 


flS./lU/Uo 


Co IS 


7 A. 


117367 N24954 


Hs.42502 


ESTS 




117392 N26175 


Hs, 93405 


COT a 

colS 


O.O 


ii'VtAJ ft. lAAAn 

117394 N26257 


Hs.39871 


KIAAD7Z7 protein 


a a 
0.4 


117412 N26722 


Hs.42645 


ESTs 


10.1 


117498 N31726 


II 1 IRAK 

Hs.44268 


ESTs; Highly similar to myelin gene expr 


C ft 

0.0 


> — ft I A AAA A 

117557 N33920 


Hs.44532 


diubkptin 




j jmAa a a ft^jftt jji^ 

117634 N36421 


ft A A A\ A% A% />ft 

Hs.13323 


ftftAftw^, . | i i _ ft .ft _ i #*- _ J _ j«» /** r\tt U J J ft If*** 

ESTs; Weakly similar to SODIUM- AND 


4.4 


A * »1 AAA ft. IamaM 

117639 N36923 


Hs.44833 


ESTs 


c 
0 


j A A ft a ft> Aftft. 

117754 N47469 


1 1 Jft Aubtm 

Hs.59757 


ESTs 


7.B 


A A\ ftftl A M AK. A 1 A A A All 

117852 N49408 

• >B|k taA Aft^AA0*A 


• 1 A\ A A A AA 

Hs.136102 

■ i A AA A AP* 


l/i i » A.npA — ft • 

KIAA0853 protein 

Afti jftftAftft til ft | • ■ • ft _ ft J — J ft| - A A»k_, 

ESTs; Weakly similar to keratin; 67Ktyp 
ESTs 


5.9 
7,9 


117879 N50050 


Hs.303025 


117924 N51056 


1 k AAAAA 

Hs.38891 


•» A 

7.9 


.A .A ftftl Afttt A A. a A Aft AA J 

117950 N51394 


a • «h»A A*«>A 

•Hs.75478 


l/t A ■ A A ^ A 1 •__ 

KIAA0956 protein 


5 


117992 N52000 


ft ft iltAAAA 

Hs.172089 


ft | • Mi || Avill ft\| ^f*^_k#* 

Homo sapiens mRNA; cDNA DKFZp586 


7 


118138 N57773 


Hs.93560 


ESTs; Weakly similar to trg [R.norveglcu 


4.8 


A J A\ AA A m> a a AA J A *W 

118215 N62195 


ft • ftMA A A 

Hs.77910 


3-hydroxy^^ethylglutaryl-CoenzymeA 


13.4 


118229 N62339 


Hs.166254 


i A ft 1 A At ftfftft A 1 A 1 i_ 

heat shock 90kD protein 1 ; alpha 


5.4 


118265 N62827 


Hs.48645 , 


EST 


. 42 


118336 N63604 


Hs.47166 


ESTs 


7.2 . 


a A A* aa A M k ■ A A J «ft «■ 

118363 N64168 


■ ft J AAA A 

Hs.48938 


ESTs 


6 


118429 N66158 


Hs.74649 


ESTs 


4.1 


118470 N66769 


Hs.291033 


ESTs 


C A 

5.4 


118472 N66818 


Hs.42179 


ESTs 


10.8 


118475 N66845 




ft* ■ ft ■ ftft I m i ft t hi A ft ft ft Al A 

ESTs; Weakly similar to 111) ALU CUSS 


4.5 


118493 N67149 


Hs.50115 


ESTs 


53 


118526 N67889 


Hs.49397 


ESTs 


10.4 


118542 N68010 


Hs.49427 


■ftp AAft 

ESTs 


7.9 


118600 N69222 




ESTs 


9.2 


118695 N71781 


Hs.50081 


ft 1 • lAhi I A AT ftft ft ftft_ V ft _ >^ 

Homo sapiens mRNA foD length Insert cD 


9.8 


118698 N72113 


Hs.50187 


ESTs 


ft o 

4.3 


118901 N90719 


Hs.94445 


ESTs 


0.1 


118952 N92966 




WAw A*. ft aft l | ■ ft* m a^ ^ftLft AA ^ 1 t rt 1 

ESTs; Highly similar to CGI-90 protein [H 


12^ 


118976 N93629 


Hs.93391 


ESTs 


5 


A A A^. a. A A. k ■ A a f\ fm Am 

118986 N94362 


■ 1 A A wt AAA 

Hs.1 25830 


ESTs 


7.3 


118989 N94439 


Hs.45105 


ESTs 


a rs 

82 


119027 N99256 


Hs.1 14611 


ESTs 


A 

5 


119042 R05316 


Hs.5472 


ESTs 


4 


119075 R36451 


Hs.287820 


fibronectin 1 


6 


119260 T15916 


Hs.102950 


ESTs; Highly similar to coat protein gamm 


4.1 


119271 T16387 


Hs.65328 


ESTs 


12.1 


119298 T23820 


Hs.155478 


cyclinT2 


«■ A 

5.6 


119302 T25725 




ESTs 


A A A 

14.3 


119341 T62571 


Hs.146388 


■ ill 1 ft ft ftS 

mlcrotubule-associated protein 7 


' 4 


119495 W35390 


Hs.55533 


ESTs 


5.3 


119580 W42451 


Hs.92260 


■ ft ft ft tlftft ft 1 ftftft A 

high-mobiltty g roup protein 2-flke 1 


5.6 


119602 W46286 


Hs.233694 


ESTs; Weakly similar to ZK1058.5 [C.ele 


A A 

6.5 


119620 W47620 


Hs.56009 


2 > -5 , oligoadenytate synthetase 3 


8.1 


119576 W60473 


Hs.57787 


ESTs 


5.5 


119717 W69134 


Hs.57987 


ESTs 


4.6 


119729 W69747 


Hs.94806 


I y 1 A ft. A AAA ft _ 

KIAA1062 protein 


4 


119805 W73788 


Hs.43213 


ESTs 


4 


119859 W80702 


Hs.58461 


ESTs 


4.8 


119867 W80852 . 


Hs.250696 


KDEL (Lys-Asp-Glu-Leu) endoplasmic re 


A2 


119873 W81129 


Hs.44865 


Homo sapiens mRNA; cDNA DKF2^)586 


4.8 


119899 W84767 


Hs.58698 


ESTs 


5.9 


119940 W86779 


Hs.272531 


DKFZP586B0319 protein 


9 


119943 W86835 


Hs.1 4158 


copine III 


, 4.8 


119970 W87812 


Hs.93581 


Homo sapiens mRNA; cDNA DKFZp586 


4 


120131 Z38656 


Hs.75887 


coatomer protein complex; subunit afpha 


4.2 


120150 Z39549 


Hs.1 53746 


ESTs 


11 


120206 Z40805 


Hs.91668 


ESTs 


82 


120241 241815 


Hs.65946 


ESTs 


15.6 


120255 AA169752 Hs.5672 


ESTs; Weakly simitar to Similarity to Yea 


42 


120314 AA194166 


Hs.221040 


KIAA1038 protein 


6.8 


120325 AA195651 


Hs.104106 


ESTs 


152 


120352 AA211400 Hs.193172 


ESTs 


6.8 
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A ^| A JAM A M AAAA#*A 

120428 AA236822 


ft ft i*IAJ%Ai 

Hs.173694 


KIAA1097 protein 


0.0 


120524 AA261852 


Hs.192905 


ESTs 


c c 
o.o 


A A A AAA a ft AAA ^ A«* 

120528 AA262107 


Hs.104413 


ESTs 


A C 


120571 AA280738 


Hs.34892 


mm Am* 

ESTs 


A A 

4.9 


120649 AA287115 


Hs.192843 


ESTs 


A r 


120655 AA287347 


Hs.238205 


ESTs 


6.7 


120668 AA287833 


Hs.292913 


ESTs 


8.3 


120712 AA292654 


Hs.102506 


ft lit h • « t |,| ,| 0 i a ft 

eukaryotic translation initiation factor 2 a) 


A f* 

4.6 


120713 AA292655 


Hs.96557 


ESTs 


10.6 


120724 AA293470 


Hs.100747 


ESTs 


.5.4 


120873 AA358015 




EST 


7.1 


120885 AA365515 


Hs.301872 


ESTs; Moderately similar to ill! ALU SU 


J a 

4.6 


120919 AA381125 


Hs.301444 


ESTs 


8.2 


120948 AA397822 


Hs.104650 


ESTs; Highly similar to similar to mago n 


8.6 


120969 AA398116 


Hs.129206 


casein kinase 1; gamma 3 


10.5 


120977 AA398155 


Hs.97600 


ESTs 


10.9 


121103 AA398936 


Hs.97697 


EST 


7.4 


121291 AA401753 


Hs.8166 


(ung cancer candidate 


5.3 


121320 AA403008 


Hs.301927 


T-cell receptor; alpha (V;D;J;C) 


13.5 


121463 AA411745 


Hs.239681 


ESTs; Wealdy similar to KIAA0554 prote 


8.9 


121596 AA416740 


Hs.174104 


ESTs 


22.6 


121723 AA419622 


Hs.104800 


ESTs; Wealdy similar to Mouse 19.5 mRN 


8 


121748 AA421171 


Hs.234545 


ESTs 


5.6 


122125 AA434411 


Hs.98806 


ESTs 


5.3 


122522 AA449444 


Hs.98969 


ESTs 


4 


122655 AA454756 


Hs.97837 


ESTs 


4 


122704 AA456326 


Hs.99445 


ESTs 


62 


122782 AA459894 


Hs.99472 


ESTs 


5.3 


122856 AA463740 


Hs.75367 


Src-Uke-adapter 


JM A 

13.1 


122882 AA465381 


Hs.108812 


ESTs; Weakly simflar to B0O41.5 [C.eteg 


5.5 


122928 M476578 


Hs.101840 


ESTs 


6.3 


122974 AA478625 


Hs.194215 


ESTs 


A 

6 


122997 AA479295 


Hs.106290 


Kelch motif containing protein 


A A 

12.5 


123016 AA480103 


Hs.323231 


ESTs; Weakly similar to alternatively spli 


4.4 


123107 AA486071 


Hs.104207 


ESTs 


8.3 


123111 AA486273 


Hs.191721 


ESTs 


A A. ' 

4.2 


123114 M486407 


Hs.129928 


ESTs; Moderately similar to K1AA0454 p 


A* A 

52 


123136 AA487449 


Hs.194024 


ESTs 


A A. 

4.2 


123137 AA487468 


Hs.100686 


ESTs; Weakly similar to secreted cement 


14.6 


123169 AA488692 




ESTs; Weakly simflar to Gag-Pol poiypro 


J A 

4.5 


123176 AA489020 


Hs.69233 


ESTs 


52 


123338 AA504249 


Hs.187585 


ESTs 


4 


123436 AA598714 


Hs.223014 


protease; serine; 15 


7.3 


123442 AA598803 


Hs.111496 


ESTs 


5.9 


123449 M598899 


Hs.1 12493. 


Homo sapiens mRNA; cDNA DKFZp564 


, 4.1 


123494 AA599786 


Hs.112110 


ESTs 


4 


123503 AA600121 


Hs.293156 


ESTs 


12.8 


123533 AA608751 




ESTs; Weakly similar to !!!! ALU SUBFA 


7.9 


123619 AA609200 




ESTs 


23.1 


123673 AA609471 


Hs.158549 


ESTs 


6,6 


123729 AA6Q9770 


nS.t/QOfc 


memDrane cornponeni, cnrurnosoine n, s 


47 


123819 AA620636 


Hs.1 12264 


ESTs 


4 


123960 AA621785 


Hs.287733 


memyimatonate-semialctehyde dehydroge 


7.6 


124000 057317 


Hs.74861 


activated RNA polymerase It transcription 


4.4 


124006 D60302 


Hs.270016 


ESTs 


20.6 


124012 D80240 


Hs.241471 


HUM5G11A Human fetal brain (TFujiwa 


6.7 


124021 F02859 


Hs.13974 


ESTs 


4.7 


124049 F10523 


Hs.74519 


primase; polypeptide 2A (58kD) 


4.7 


124059 F13673 


Hs.283713 


ESTs 


7.7 


124243 H66710 


Hs.133525 


ESTs 


5.5 


124308- H93575 


Hs.241507 


Homo sapiens mRNA; cDNA DKFZp564 


11.4 


124314 H94877 - 


Hs.215766 


GTP-binding protein 


13.7 


124315 H94892 


Hs.288757 


v-ral simian leukemia viral oncogene horn 


14 


124350 N21359 


Hs.101282 


Homo sapiens mRNA; cDNA DKFZp434 


8.6 


124352 N21626 


Hs.102406 


ESTs 


7.2 


124357 N22401 




yw37g07.s1 Morton Fetal Cochlea Homo 


5.2 


124390 N29325 


Hs.7535 


ESTs; Highly similar to COBW-Iike place 


7.9 
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_ 12443B N40188 Hs.11090 ESTs 

124447 N48000 Homo sapiens mRNA; cDNA DKFZp586 

124457 N50114 Hs^66175 ESTs 

124539 N63172 Hs.146409 cell division cycle 42 (GTP-binding prate 

5 124626 N74604 Hs.11090 ESTs 

124632 N79515 Hs.306117 interieukin 13 receptor; alpha 1 

124644 N91279 Hs.109654 ESTs; Moderately simOar to outer membr 

124676 R01037 Hs.181013 phosphoglycerate mutase 1 (brain) 

124677 R01073 ESTs; Weakly similar to llll ALU CLASS 
10 124724 R12405 Hs.112423 Homo sapiens mRNA; cDNA DKFZp586 

. 124773 R40923 Hs.105604 ESTs 

124777 R41933 ESTs 

124792 R44357 Hs.48712 ESTs; Weakly similar to cDNA EST EMB 

124857 R63652 Hs.137190 ESTs 

15 124911 R88992 Hs.180612 ESTs 

124955 T10598 Hs.324841 ESTs; Weakly similar to llll ALU SUBFA 

124958 T11134 Hs.431 murine leukemia viral (bmM) oncogene h 

125038 T78089 Hs.270134 ESTs 

. 125092 T92544 Hs.137548 CD84 antigen (leukocyte antigen) 

20 125132 W15495 Hs.129781 chromosome 21 open reading frame 5 

125144 W37999 Hs.24336 ESTs . 

125154 W38419 ESTs 

125243 W86423 Hs.105413 ESTs 

125279 W93640 Hs.4779 ESTs; Moderately similar to similar to AD 

25 125299 239436 Hs.102720 ESTs 

125303 Z39821 Hs.288193 ESTs 

125304 Z39833 Hs.124940 GTP-binding protein 

125474 AA151216 Ks.75103 tyrosine 3-monooxygenase/tryptophan 5-m 

125509 AA044232 Hs.288967 ESTs 

30 125580 AA126504 Hs.267812 sorting nexin 4 

125582 AA507383 Hs.74649 cytochrome c oxidase subunit Vic 

.125670 AI432621 Hs.82685 CD47 antigen {Rh-related antigen; htegri 

125698 AA748483 Hs.191356 general transcription factor IIH; polypepti 

125745 AI283493 Hs.75722 ribophorinll 

35 125852 H09290 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564 

125972 AA434562 Hs.35406 ESTs 

126160 N90960 Hs.265398 ESTs; Weakly similar to transformations 

126257 N99638 Hs.1 24084 tumor necrosis factor receptor superfamlly 

126337 AI066486 Hs.40500 similar to S. cerevisiae RER1 

40 126405 U46278 Hs.122489 ESTs 

126537 W40262 Hs.146310 ESTs; Weakly similar to putative p150 [H 

126590 W78968 Hs.181307 H3 histone; family 3A 

126712 AA205862 Hs.7942 ESTs 

126721 T72569 Hs.125359 Thy-1 cell surface arisen 

45 126764 AI334393 Hs.102178 ESTs 

126804 AI203334 Hs.160628 ESTs 

126819 AA305536 Hs.279607 ESTs 

126877 AI052047 Hs.26102 ESTs 

126991 R31652 Hs.821 biglycan 

50 127479 AA513722 Hs;179729 collagen; type X; alpha 1 (Schmld meiaph 

127514 AA826926 Hs.204214 ESTs 

127663 W07286 Hs.10340 ESTs; Weakly similar to weak similarity t 

127677 AA916752 Hs.264190 ESTs; Highly similar to MEM3 [M.muscu 

127814 AA761755 ■ Hs.136713 ESTs; Weakly similar to V4-1 [Ksapiens 

55 127997 AI281549 Hs.311054 ESTs 

128092 AA904617 Hs.166229 ESTs 

128218 H02682 Hs.292154 ESTs; Moderately similar to recombinatk) 

128466 D59653 Hs.241471 EST 

128482 U83908 Hs.296251 programmed cell death 4 

60 128517 AA280617 Hs.100861 ESTs; Weakly similar to p60 katanln [H.s 

128530 AA504343 Hs.1 83475 Homo sapiens clone 25061 mRNA sequen 

128559 AA226801 Hs.1 01448 metastasis associated 1 

128574 AA412048 Hs.38260 keratin 8 

128595 U31875 Hs.1 52677 short-chain alcohol dehydrogenase family 

65 128610 L38608 Hs.10247 activated leucocyte cell adhesion molecule 

128629 AA399187 Hs.102708 DKFZP434A043 protein 

128649 AA142853 Hs.103106 Homo sapiens mRNA for G7b protein (G 
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7 
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128651 


AA446990 


Hs.103135 


ESTs 


C 4 
0.1 


128653 


R48943 


Hs.10315 


solute carrier family 7 (cationlc amino ad 


A A 

4.4 


128656 


AA458542 


Hs.10326 


coatomer protein complex; subunit epsllon 


143 


128717 


T30617 


Hs.104222 


i ■ _ _ • . f\ % | ft AiA 1 A 1 A^*^B tWtSfS 

Homo sapiens mRNA; cDNA DKFZp566 


24.5 


128727 


M64174 


Hs.50651 


Janus kinase 1 (a protein tyrosine kinase) 


T n 

7.3 


128764 


N49308 


Hs.104938 


^m*±^m I A 1 || ft H ft b 1 A MM Mitt 

ESTs; Weakly similar to alpha 1 (XVI II) c 


A A 

9.2 


128781 


X85372 


Hs.105465 


small nuclear ribonucleoproteln polypeptl 


r a 

5.4 


128793 


W93562 


Hs.1 05749 


IMAM A At* ^ A _ I ft 

KIAA0553 protein 


A £ 

4.6 


128835 


W15528 


Hs.106390 


ft b ft P^ft ft ft ft^ft. 1 A ATi^^m a|b M 

Homo sapiens mRNA; cDNA DKFZp586 


A 

4 


128845 


AA455658 


Hs.10649 


basement membrane-induced gene 


6.9 


128871 


AA400271 


Hs.106778 


ft ft 1 4 1 M. f £ ft* A¥*h. AJ4\ ft 

Homo sapiens mRNA for putative Ca2+4 


4.5 


128922 


AA252023 Hs.9589 


... m m kftft II ■ M j | »pk|| 1 1~ J1_ J> J B1I m 1 
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Hs,181301 

Hs.125849 

Hs.146428 

Hs.91379 

Hs.147097 

Hs.197955 

Hs.14992 

Hs.1504 

Hs.21635 

Hs.151469 

Hs.151761 

Hs.15250 

Hs.23703 

Hs.153053 

Hs.169149 

Hs.153937 

Hs.154320 

Hs.154332 

Hs.154668 

Hs.155017 

Hs.8768 

Hs.155188 

Hs.166071 

Hs.155291 

Hs.155356 

Ha.155410 

Hs.241392 

Hs.155485 

Hs.178710 

Hs.155637 

Hs.155956 

Hs.180446 

Hs.158286 

Hs.1584 

Hs.252587 

Hs.15984 

Hs.16085 

Hs.293907 

Hs.16331 

Hs.1657 

Hs.16697 

Hs.12532 

Hs. 16846 

Hs.260720 

Hs.1695 

Hs.1697 

Hs.1706 

Hs.17132 

Hs.17767 

Hs.17820 

Hs.17962 

Hs.18103 

Hs.201673 

Hs.279762 

Hs.18212 

Hs.171952 

Hs.18747 

Hs.6879 

Hs.18879 

Hs.19525 

Hs.19574 

Hs.143323 

Hs.20478 

Hs.2055 

Hs.20830 

Hs.20991 
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Hs.21276 
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Hs.21486 

Hs.2178 

Hs.21893 

Hs.21992 

Hs.169341 

Hs.2248 

Hs.22880 

Hs.23606 

Hs.23960 

Hs.293732 

Hs.24210 

Hs.95549 

Hs.236522 

Hs.24763 

Hs.24766 

Hs.24908- 

Hs.255?0 

Hs.25812 

Hs.92200 

Hs.279836 

Hs.27258 

Hs.27263 

Hs.8207 

Ks.2795 
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Hs.28309 

Hs.28555 
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Hs.28777 

Ks.28792 

Hs.26966 
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Hs.30327 
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Hs.1 10796 
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Hs.31433 

Hs.107767 
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Hs.3260 
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Hs.180628 
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Hs.3622 
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ESTs 
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ESTs 

calcycfin bindtog protein 
ESTs 

H3 hlstone; family 3A 
lactate dehydrogenase A 
ESTs 

UDP-giucose dehydrogenase 

ESTs; Weakly similar to B051 1 .8 [C.eteg 
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H2A histone family; member L 
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MRS1 protein 
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ESTs 
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ESTs 

KIAA0205 gene product 
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Homo sapiens mRNA; cDNA DKFZp586 
procollagen-proline; 2-oxoglutarate 4-dfox 
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132021 T6B246 Hs.306079 chaperonin containing TCP1; subunlt 5 {e 

132065 D82226 Hs.211594 pmteasome (prosome; macropaln) 26S sub 

132085 D44486 Hs.3887 proteasome (prosome; macropaln) 26S sub 

132089 AA131971 Hs.39122 ESTs 

5 132109 AA599801 Hs,40098 ESTs 

132143 AA257056 Hs.7972 KIAA0871 protein 

132149 T10822 Hs.324743 ESTs 

132153 N90141 Hs.41066 ESTs; Moderately similar to ELONGAT1 

N 132160 AA281770 Hs.295923 seven In absentia (Drosophaa) homolog 1 

10 132164 U84573 Hs.41270 procotegen-lysine; 2^xoglutarate 5^Ifoxy 

132180 AA405569 Hs.418 fibroblast activation protein; alpha; sepras 

132183 L19183 Hs.199695 hypothetical protein 

132225 AA12B980 ESTs 

132227 AA412620 Hs.4248 ESTs 

15 132235 F09058 Hs.42656 ESTs 

132256 AA608856 Hs.431 murine leukemia viral (bmi-1) oncogene h 

132298 N41849 Hs.7120 Homo sapiens cytokine receptor related p 

132314 AA285290 Hs.44499 small EORK-rich factor 2 

132325 N37Q65 Hs.44856 ESTs 

20 132384 AA479933 Hs.46967 Human DNA sequence from done 167A1 

132387 R70914 Hs.281434 heat shock 70kD protein 1 

132393 W85888 Hs.47334 ESTs; Moderately similar to III! ALU SU 

132406 F09979 Hs.4774 ESTs 

132407 AA431459 Hs.47783 ESTs 

25 132413 AA132969 Hs.260116 KIAA1104 protein 

132446 AA426218 Hs.48764 ESTs 

132465 AA047896 Hs.49169 ESTs 

132482 AA429478 Hs.238126 ESTs; Highly similar to CGI-49 protein [H 

132492 T03749 Hs.4990 KIAA1089 protein 

30 132528 AA283006 Hs.50758 chromosome-associated polypeptide C 

132540 AA488987 Hs.5097 synaptogyrin 2 

132543 AA417152 Hs.5101 protein regulator of cytokinesis 1 

132580 L37042 Hs.283738 casein kinase 1; alpha 1 

132586 AA412452 Hs.52515 DKFZP434N024 protein 

35 132608 AA1 99588 Hs.5321 ARP3 (actin-related protein 3; yeast) horn 

132616 AA386264 Hs.283558 isocitrate dehydrogenase 2 (NADP+); mit 

132617 AA171913 Hs.5338 carbonic anhydrase Xll 

•132618 AA253330 Hs.279916 adaptor-related protein complex 1; gamma 

132640 U33821 Hs.5437 Taxi (human T-ceB leukemia vims type I 

40 132668 AA453614 Hs.5460 KIAA0776 protein 

132694 M60830 Hs.5509 ecotropic viral integration site 2B 

132700 N47109 Hs.5521 ESTs 

132724 AA417962 Hs.55498 geranylgeranyl diphosphate synthase 1 

132738 W42674 Hs.264636 ESTs; Moderately similar to neuronal thre 

45 132742 AA490862 Hs.292812 ESTs; Weakly similar to C43H8.1 [C.eteg 

132744 X54326 Hs.55921 glutamyH)ro!yl-tRNA synthetase 

132795 H99152 Hs.57079 ESTs 

132807 AA331777 Hs.57301 mint (E. coli) homolog 1 (colon cancer; h 

132811 U25435 Hs.57419 transcriptional repressor 

50 132817 AB004884 Hs.57553 tousled-like kinase 2 

132840 N23817 Hs.5807 Homo sapiens clone 23675 mRNA sequen 

132845 D62588 Hs.5813 ESTs 

132847 T48195 Hs.58189 eukaryoHc translation initiation factor 3; s 

132856 W79865 Hs.58367 glyplcan4 

55 132869 N26855 Hs.203961 ESTs 

132874 AA425776 Hs.58609 ESTs 

132880 AA444369 Hs.177537 ESTs 

132894 D82422 Hs.5944 ESTs 

132900 N56451 Hs.5978 UM domain only 7 

60 132903 AA235404 Hs.5985 Homo sapiens clone 25186 mRNA sequen 

132904 X83618 Hs.59889 3-hydroxy-3-methylglutaryl-CoenzymeA 

132906 AA142857 Hs.234896 ESTs; Highly similar to geminin [H.sapie 

132914 AA496037 Hs.60293 ESTs 

132918 AA252605 Hs.6051 KIAA0616 protein 

65 132936 AB002305 Hs.6111 KIAA0307 gene product 

132951 U04209 Hs.61418 mlcrofibrillar-associated protein 1 

132957 AA234791 Hs.61469 Human gene from PAC 753P9; chromoso 
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132959 AA028103 Hs.61472 ESTs; Weakly similar to unknown [S.cere 18.9 

132988 N77151 Hs.61638 myosin X 5.8 

132984 H80409 Hs.62112 zinc finger protein 207 4.3 

132990 AA458761 Hs.18387 transcription factor AP-2 alpha (activating 4.2 

132994 AA505133 Hs.279905 solute carrier family 2 (facilitated glucose 26.4 

132998 Y00062 Hs.170121 protein tyrosine phosphatase; receptor typ 4.4 

133002 AF006082 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 4.7 

133005 C21400 Hs.278605 . KIAA0970 protein 6.6 

133015 AA047036 Hs.246315 ESTs 7.9 

133016 W81298 Hs.6289 growth factor receptor-bound protein 2 5.2 
133039 X62055 Hs.63489 protein tyrosine phosphatase; non-recepto 4 
133050 S67325 Hs.63788 propionyi Coenzyme A carboxylase; beta 5.2 
133056 AA071387 Hs.6396 jumping translocation breakpoint 5 
133062 R33663 Hs.64056 ESTs 5.4 
133083 N70633 Hs.6456 chaperonin containing TCP1; summit 2 (b 6 
133091 AA122147 Hs.64691 KIAA0483 protein 5 
133093 AA598749 Hs.285996 ESTs 5.6 
133124 AA156049 Hs.267923 ESTs 4.1 
133126 D16469 Hs,6551 ATPase; H+ transporting; lysosomal (vacu . 6.2. 
133196 R37367 Hs.6727 ' Ras-GTPase activating protein SH3 doma 5.1 t 
133214 Y10659 Hs.285115 interieukin 13 receptor; alpha 1 6.2 
133225 241415 Hs.6823 ESTs; Weakly similar to intrinsic factor-B 8.3 
133228 N90029 Hs.6831 Homo sapiens clone 1400 unknown prate 4.7 

133239 AA059405 Hs.179882 Homo sapiens done 24655 mRNA sequen 5.5 

133240 D31161 Hs.242894 ESTs 9 
133257 AF006086 Hs.6895 actin related protein 2/3 complex; subunit 7.7 
133264 W72187 Hs.69192 ESTs; Weakly similar to cDNA EST yk37 ' 6.7 
133274 AA488886 Hs.6949 ESTs 4.2 
133281 AA421079 Hs.69594 ESTs; Weakly similar to Sox-like transcri , 4.9 
133283 AA410507 Hs.6968 . ESTs 4.3 
133287 115702 Hs.69771 B-facton properdin 9.3 
133294 R79723 Hs.69997 zinc finger protein 238 30.4 
133297 AA600057 Hs.70266 K1AA0905 protein 10.4 
133318 AA256168 Hs.152316 ESTs 8.5 
133362 H06195 Hs.7194 ESTs; Highly similar to CGl-59 protein (H 14 
133370 AA156897 Hs72157 DKFZP564I1922 protein 5 
133391 X57579 Hs.727 Inhibln; beta A(actMnA;activin ABalp 13.9 
133395 AA491296 Hs.72805 ESTs 4.3 
133422 N79516 Hs.73287 ESTs; Weakly similar to eyelid p.melano 4.5 
133431 AA255438 Hs7358 Homo sapiens mRNA; cDNA DKF2p566 8 
133435 T23983 Hs.323966 ESTs 5 
133449 AA094989 Hs.7381 voltage-dependent anion channel 3 8.7 
133468 X03068 Hs.73931 major histocompatibility complex; class (I 5 
133484 X78710 Hs.211581 metal-regulatory transcription factor 1 < 5.3 
133506 AA316868 Hs.74346 ESTs; Weakly similar to 140G1 1 .h pine 6.8 
133517 X52947 Hs.74471 gap junction protefo; alpha 1; 43kD (conn 5.7 
133551 D63480 Hs.278634 KJAA0146 protein 4.8 
133569 AA313977 Hs,172772 transcription elongation factor B (SHI); po 9.5 
133572 W94333 Hs.279915 transiocase of inner mitochondnal membr 5 
133577 F03717 Hs.75063 human Immunodeficiency virus type I enh 7.4 
133589 L37368 Hs.75104 RNA-binding protein S1;serine-richdom 5 
133608 013315 Hs.75207 glyoxalasel 4.2 
133617 AA148318 Hs.75249 KIAA0069 protein 4.5 
133627 U09587 Hs75280 glycyl-tRNA synthetase 10 

133633 D21262 Hs.75337 nucleolar phosphoproteln p130 4.5 

133634 U24166 Hs.234279 microtubule-associated protein; RP/EB fa 152 
133640 D83004 Hs.75355 ubiquitin-conjugating enzyme E2N (homo 9.1 
133644 D89077 Hs.75367 Src4ike-adapter ,6.4 
133649 AA479139 Hs.75393 acid phosphatase 1; soluble 4.8 
133652 AA287383 Hs.7540 ESTs 4.2 
133674 AA458946 Hs.75497 ESTs 4.3. 
133700 K01396 Hs.297681 protease Inhibitor 1 (anti-elastase); alpha- 8.3 
133705 N21648 Hs.75659 MpV17 transgene; murine homolog; glom 4.6 
133716 Y00282 Hs75722 ribophorinll 7.5 
13372) L27841 Hs.75737 pencentrfotar material 1 9.4 
133752 U49278 Hs.75875 ubiquitin-conjugating enzyme E2 variant 4.5 
133765 D21255 Hs75929 cadherin 11 (OB-cadherin; osteoblast) 6.4 
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133772 
133774 
133776 
133784 
133814 
133829 
133834 
133839 
133842 
133845 
133859 
133867 
133868 
133871 
133893 
133914 
133918 
133944 
133946 
133963 
133980 
133990 
133999 
134030 
134032 
134045 
134046 
134064 
134070 
134087 
134090 
134098 
134110 
134132 
134168 
134170 
134208 
134258 
134288 
134310 
134326 
134329 
134331 
134351 
134357 
134363 
134367 
134374 
134375 
134376 
134381 
134388 
134395 
134399 
134401 
134405 
134415 
134417 
134419 
134421 
134423 
134438 
134446 
134453 
134470 
134487 
134495 



W73693 Hs.76038 teopentenyWiphosphale deJta Isomerase 

Z23090 Hs.76067 heat shock 27kD protein 1 

J03473 Hs.177766 ADP-ribosyltransferase (NAD+; poly (AD 

AA214305 Hs.301064 ESTs 

M33882 Hs76391 myxovirus (Influenza) resistance 1 ; homo! 

AA453783 Hs76550 Homo sapiens mRNA; cONA OKFZp564 . 

AA1 47510 Hs.288660 serine protease; umbilical endothelium 

M59815 Hs.170250 complement component 4A 

U73477 Hs.285013 putative human HLA class II associated p 

T68510 HS76704 ESTs 

U86762 Hs. 178761 26S proteasome-assocfated pad1 homolog 

D43948 Hs76989 KIAA0097 gene product 

U58090 Hs.183874 cullln4A 

AA454597 Hs.182793 ESTs 

X01060 Hs.77356 transferrin receptor (p90; CD71) 

N32811 Hs.77542 ESTs 

W72783 Hs.58382 ESTs; Weakly similar to C13F10.5 [Cele 

AA045870 . Ms.7780 Homo sapiens mRNA; cDNA DKFZp564 

AA156565 Hs.173878 4-nitrophenyfphosphatase domain and non 

L34587 Hs.184693 transcription elongation factor B (Slil); po 

D00760 Hs.25081 1 proteasome (prosome; macfopaln) subuntt 

C02374 Hs7822 Homo sapiens mRNA; cDNA DKFZp564 

M28213 Hs78305 RAB2; member RAS oncogene family 

J03077 Hs.78575 prosaposin (variant Gaucher disease and v 

Z81326 Hs78589 protease Inhibitor 12 (neuroserpln) 

S82470 Hs78768 BB1 

D28473 Hs.172801 Isoleucine-tRNA synthetase 

D87685 Hs78893 KIAA0244 protein 

H98621 Hs.78946 cullin 3 

U51166 Hs.173824 thymine^NA gtycosyiase 

M22382 Hs79037 heat shock 60kD protein 1 (chaperonin) 

X06323 Hs79086 ribosomal protein; mitochondrial; L3 

U41060 Hs.79136 UV-1 protein; estrogen regulated 

U32519 Hs.220689 Ras-GTPase-activating protein SH3-doma 

AA398908 Hs.181634 Human Chromosome 16 BAC clone CIT9 

M63138 . Hs.79572 cathepsin D (lysosomal aspartyi protease) 

U88871 Hs79993 peroxisomal biogenesis factor 7 

L28010 Hs.808 heterogeneous nuclear ribonudeoprotein F 

AA430008 Hs.8117 ESTs 

AA313414 Hs.8148 Homo sapiens clone 24856 mRNA sequen 

U16306 Hs.81800 chondroltin sulfate proteogJycan 2 (veraic 

D38551 Hs.81848 " RAD21 (S. pombe) homolog 

AA452020 Hs.1 11222 ESTs; Weakly similar to CGI-1 28 protein 

R82074 Hs.82109 syndecan 1 

L43575 Hs.82171 Human clone 191B7 placenta expressed m 

M37033 Hs.82212 C053 antigen 

X54199 Hs.82285 phosphoribosylgtycinamide formyltransfe 

062633 Hs.8236 ESTs 

AA412720 Hs.82389 ESTs; Highly similar to CGI-118 protein 

X02874 Hs.82396 2 , ;5'-oHgoadenylate synthetase 1 

U56637 Hs.184270 capping protein (actin filament) muscle Z- 

■M15841 Hs.82575 small nuclear ribonudeoprotein polypepti 

L09717 Hs.8262 lysosomal-associated membrane protein 2 

H99801 Hs.82689 tumor rejection antigen (gp96) 1 

AA243746 Hs.211577 kinectin 1 (kinesln receptor) 

J04177 Hs.82772 collagen; type XI; alpha 1 

AA329274 Hs.82911 protein tyrosine phosphatase type IVA; m 

087969 Hs.82921 solute carrier family 35 (CMP-siaifc acid t 

L08044 Hs.82961 trefoil factor 3 (intestinal) 

AA122386 Hs.82985 collagen; type V; alpha 2 

W96151 Hs.83006 ESTs; Highly similar to CGM39 protein 

AA449984 Hs.246857 ESTs; Highly similar to proteine kinase JN 

T25732 Hs.83419 KIAA0252 protein 

X70683 Hs.83484 SRY (sex determining region Y)-box 4 

X54942 Hs.83758 CDC28 protein kinase 2 

R38185 Hs.83954 Homo sapiens unknown mRNA 

D63477 Hs.84087 KIAA0143 protein 
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134498 
134508 
134529 
134570 
134582 
134600 
134623 
134654 
134655 
134675 
134711 
134714 
134722 
134776 
134806 
134810 
134840 
134643 
134853 
134866 
134868 
134885 
134982 
134989 
134992 
134993 
135010 
135015 
135029 
135032 
135037 
135059 
135071 
135083 
135117 
135144 
135154 
135218 
135237 
135243 
135335 
135349 
135367 
135389 
135400 
135411 
300019 
300021 
300022 
300089 
300107 
300254 
300328 
300549 
300711 
300921 
301124 
301165 
301576 
301604 
301704 
301782 
301884 
301936 
302002 
302032 
302067 



M63180 Hs.84131 
U45328 Hs.84285 
H24460 Hs.848 
U66615 Hs.172280 
AA234966 Hs.86041 
R68884 Hs.86347 
X7442S Hs.86978 
W23625 Hs.8739 
AA454070 Hs.123090 
AA250745 Hs.87773 
X04011 Hs.88974 
U89922 Hs.890 



threonyl-tRNA synthetase 
ubiquitin-conjugatlng enzyme E2I (homol 
FK506-bInding protein 4 (59kD) 
SWt/SNF related; matrix associated; actin 
CGG triplet repeat binding protein 1 
ESTs; Weakly similar to predicted using Q 



W47183 

J05582 

Z49099 

M27394 

U51477 

H60595 

D82348 

U84011 

Z39762 

N27670 

N46086 



Hs.284226 

Hs.89603 

Hs.89718 

Hs.89751 

Hs.89981 

Hs.90061 

Hs.90280 

Hs.904 ■ 

Hs.90419 

Hs.907^ 

Hs.92308 



AA236324 Hs.92381 

H05625 Hs.5831 

AA282343 Hs.301005 

D59675 Hs.92927 

U54999 Hs.278338 
AA224180 

AA243497 Hs.173685 

U77948 Hs.278589 

AA598449 Hs.93832 

L08069 Hs.94 

AA495950 Hs.94262 

W52493 Hs.94694 

AA044842 Hs.95260 

AA126433 Hs.267812 

D31157 Hs.324277 

AA454930 Hs.9691 

AA215333 Hs.97101 

H20989 Hs.198281 

D83174 Hs.9930 

AAAQOW Hs.9963 

U05237 Hs.99872 

M23263 Hs.99915 

L10333 Hs.99947 
M97935 
M97935 
M97935 

AI199738 Hs.208275 

AI694585 Hs.270464 

AW079607 Hs.188417 

AW015860 Hs.224623 

AA699328 Hs.298119 

AI492179 Hs.166244 

AW293224 Hs.232165 

T79326 Hs.298262 

N85789 Hs.150186 

A1682905 Hs.270431 

AA373124 Hs.24809 

AA526313 Hs .293591 

N99399 Hs.143046 

AA312082 Hs.105445 
NM004694HS.1 14924 

AF013956 Hs.5637 
NM.001992HS.128087 

K05698 Hs.222399 



ESTs; Weakly similar to ORF YGR200c [ 
ESTs 

protein kinase; cAMP-dependent; catatyti 
cytochrome b-245; beta polypeptide (cnro 
lymphotoxin beta (TNF supertamlly; mem 
ESTs; Weakly similar to neural F box pro 
mucin 1; transmembrane 
spermine synthase 

membrane-spanning 4-domains; subfamily 
diacylglycerol kinase; zeta (104kD) 
progesterone bbiding protein 
5-aminoimidazote-4-carboxamWe ribonuc 
amylo-1;6-glucosidase; 4-alpha-glucanotr 
KIAA0882 protein 

progesterone membrane binding protein 
ESTs 

Homo sapiens mRNA; chromosome 1 spe 
ESTs 

purine-rich element binding protein B 
ESTs 

LGN protein 

ESTs; Moderately similar to 17-beta-hydr 

Human DNA sequence from done 30M3 

general transcription factor II; i 

Homo sapiens clone 24483 unknown mRN 

heat shock protein; DNAJ-iike 2 

ESTs 

Homo sapiens done 24837 mRNA sequen 
Homo sapiens mRNA; cDNA DKFZp586 
sorting nextn 4 

ESTs; Weakly similar to growth factor-res 
ESTs 

putative G protein-coupled receptor 
pyruvate kinase; muscle 
collagen-binding protein 2 (coltigen 2) 
TYRO protein tyrosine kinase binding pro 
fetal Alzheimer antigen 
androgen receptor (dihydrotestosterone re 
retteulon 1 

AFFX control: STAT1 

AFFX control: STAT1 

AFFX control: STAT1 

ESTs; Weakly similar to lill ALU CLASS 

ESTs; Weakly similar to lill ALU CLASS 

ESTs; Weakly similar to ZnT-3 [H^apien 

ESTs 

ESTs 

ESTs; WeaWy similar to cDNA EST yk40 
ESTs 

ESTs; Weakly similar to dJ88J8.1 [H.sapl 
ESTs; Weakly similar to PTERIN-4-ALP 
ESTs; Weakly similar to 111! ALU SUBFA 
ESTs; Weakly similar to C17G10.1 (C.ele 
ESTs 

EST duster (not in UniGene) with exon h 
GDNF family receptor alpha 1 
EST cluster (not In UniGene) with exon h 
chromobox homolog 4 (Drosophlla Pc da 
EST duster (not in UniGene) with exon h 
ESTs; WeaWy similar to proteln-tyrosine 



6.1 

4.6 

B2 

4.8 

4.7 

5.8 

4.5 

13.7 

5.8 

8.9 

6.8 

35.7 

8.1 

6J2 

4.2 

7 

4.1 

4.7 

10.2 

12.1 

6 

5 

4.1 

16.8 

4 

4.4 
7 

4.8 
13.6 
4 
8 

5.4 

9.3 

6.7 

10.2 

6.6 

7.4 

62 

19.5 

8.8 

12.4 
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302145 NMJ)03613Hs.151407 

302236 AI128606 Hs.6557 
302276 NMJ)04448Hs.323910 

302290 AL1 17607 Hs.176563 
5 302326 NM_004271Hs.184018 

302342 AB023141 Hs.190386 

302372 AL1 17406 Hs.200102 

302422 AB021227 Hs.3743 

302431 AF129530 Hs.226434 

10 302501 AF022726 Hs.251446 

302505 AL049650 Hs.247874 

302533 L36149 Hs.246116 

302638 AA463798 Hs.102696 

302656 AW293005 Hs.70704 

15 302792 AA343696 Hs.46621 

302820 X04588 Hs.85844 

302838 U66049 Hs.82171 

302892 N58545 , Hs.42346 

302977 AW263124 Hs.315111 

20 302989 N46406 Hs.84700 

303007 AA478876 Hs.317714 

303052 AF140242 Hs.279926 

303131 AW081061 Hs.103180 

303132 AI929819 Hs.4055 
25 . 303153 U09759 Hs.246857 

303387 AA908797 Hs.180799 

303499 AI815990 Hs.293515 
303502 AA488528 

303576 T07216 Hs.301226 

30 303620 AA397546 Hs.119151 

303634 AI953377 Hs.28444 

303642 AW299459 Hs.111977 

303654 AA436942 Hs.288529 

303733 AW502498 Hs.15220 

35 303780 A1424014 Hs.18995 

303792 C75094 Hs.199839 

303842 AI337304 Hs.126268 

303951 AW475081 Hs.172928 
304465 AA421948 

40 304507 AA456426 
304591 AA505702 
304601 AA507875 
304659 AA533185 

305040 AA630582 Hs.169476 

45 305134 AA653159 Hs.179661 

305415 AA725116 Hs.78465 
305453 AA738110 
305898 AA872838 
305913 AA876109 

50 305950 AA884479 

306004 AA889992 Hs.2186 

306009 AA894560 Hs.283370 

306060 AA906161 Hs.76277 

306398 AA970548 Hs.297681 

55 306505 AA987722 Hs.172928 

306576 AA995761 Hs.276092 

307117 AI184111 Hs.76067 

307138 AI185516 Hs.172928 

307187 AI190870 Hs.276417 

60 307542 AI280859 Hs.62954 

307554 AI281603 Hs.172928 

307806 A1351739 Hs.276726 

308079 AI472733 Hs.270208 

308307 AI581398 Hs.172928 

65 308511 AI687580 Hs.169476 

308615 AI738593 Hs.101774 
308677 AI761173 



EST cluster (not in UniGene) with exon h 
zinc finger protein 161 
EST cluster (not In UnlGene) with exon h 
Homo sapiens mRMA; cDNA DKFZp564 
EST cluster (not In UniGene) with exon h 
KIAA0924 protein 

Homo sapiens mRNA; cDNA DKFZp434 
matrix metalbprotefnase 24 (membrane-fo 
EST cluster (not In UniGene) with exon h 
EST duster (not In UniGene) with exon h 
multiple UniGene matches 
chemokine (C motif) XC receptor 1 
ESTs; Weakly similar to C1 1 D2.4 [C.eleg 
ESTs 

ESTs; Weakly simBar to putative [H.sapie 

EST cluster (not in UniGene) with exon h 

EST cluster (not in UniGene) with exon h 

histone deacetytase 3 

EST cluster (not in UniGene) with exon h 

EST cluster (not in UniGene) with exon h 

pallid (mouse) homoiog; paiiidin 

EST cluster (not in UniGene) with exon h 

actin-Qke6 

ESTs 

mitogen-activated protein kinase 9 

ESTs 

ESTs 

EST cluster (not in UniGene) wift exon h 
EST cluster (not in UniGene) with exon h 
ESTs 

ESTs; Weakly similar to predicted using G 
EST cluster (not In UniGene) with exon h 
ESTs 

ESTs; Weakly similar to zinc finger prate 

ESTs; Moderately similar to KIAA0456 p 

ESTs; Highly similar to NG22 [H^apiens 

ESTs; Weakly similar to similar to PDZ d 

collagen; type I; alpha 1 

EST singleton (not in UniGene) with exon 

EST 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
giyceraldehyde-3-phosphate dehydrogena 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
keratin 8 

EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
EST singleton (not in UniGene) with exon 
heat shock 27kD protein 1 
collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not In UniGene) with exon 
ESTs 

collagen; type I; alpha 1 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
EST singleton (not in UniGene) with exon 
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316136 AA830808 Hs.124366 
316177 AI908272 Hs.293102 
316313 AA741300 Hs.202599 
316405 AA757900 Hs.270823 
5 316480 AI749921 • Hs.205377 
316564 AI743571 Hs.168799 

316714 AA809792 Hs.123307 

316715 AI440266 Hs.170673 
316828 AA828116 Hs.173076 

10 316869 A1954880 Hs.134604 
316905 AW1 38241 Hs.210846 
316943 AW014875 Hs.137007 
316949 AA856749 Hs.124620 
317008 AW051597 Hs.143707 

15 317028 AA962623 Hs.189144 
317067 AI805392 Hs.325335 
317069 AI732892 Hs.190489 
317210 AA490718. 
317298 AI922374 Hs.158549 

20 317658 AW139077 Hs.202217 
317674 AW294909 Hs.132208 
317685 AI798630 Hs.149997 
317836 AA983913 Hs.128929 
317681 AI827248 Hs.224398 

25 317902 AI828602 Hs.211265 
317916 AI565071 Hs.159983 
318042 AW294522 Hs.149991 
318053 AI074465 Hs.133469 
318064 AW296888 Hs.170939 

30 318070 AI024594 Hs.248942 
318073 AW167087 Hs.131562 
318146 AI040125 Hs.150521 
318186 AW016773 Hs.3709 
318481 AI291584 Hs.145921 

35 318566 AI335361 Hs.226376 
318817 AW247252 Hs.75514 
318662 AI285898 Hs.294014 
318691 AW192139 Hs.181307 
318740 NM.002543HS.77729 

40 318744 AI793124 Hs.144479 
318948 AA317274 Hs.13996 
319163 F15257 Hs.27 
319478 R08841 Hs.270307 
319545 R83716 Hs.14355 

45 319668 NMJJ02731H3.87773 
319763 AA460775 Hs.6295 
319913 AA179304 Hs.271586 
319936 W22152 Hs.282929 
319951 AA307665 Hs.14559 

50 319962 H06350 Hs.135056 
319977 AA632632 
320074 AA321166 Hs.278233 
320092 AF022799 Hs.1 13292 
320107 AA836461 Hs.291712 

55 320133 D63271 

320167 AA984373 Hs.90790 
320187 T99949 Hs.303428 
320211 AL039402 Hs.125783 
320401 U90449 Hs.152717 

60 320458 AI884396 Hs.24131 
320488 R31386 Hs.1 91791 
320521 N31464 Hs.24743 
320661 AA864846 Hs.1 15175 
320691 R61576 Hs.313951 

65 320699 R63161 Hs.1 18249 
320727 U96044 Hs.1 81 125 
320993 AL050145 Hs.225986 
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ESTs 
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ESTs 
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EST cluster (not In UniGene) 
ESTs 
ESTs 

glycine dehydrogenase (decarboxytati ng; 

EST cluster (not In UniGene) 

ESTs 

EST cluster (not in UniGene) 
ESTs 

ESTs; Moderately similar to llil ALU SU 
EST cluster (not in UniGene) 
ESTs 
ESTs 

EST cluster (not in UniGene) 
EST cluster (not in UniGene) 
cafpain9(nCL-4) 
EST duster (not in UniGene) 
EST cluster (not in UniGene) 
EST cluster (not In UniGene) 
EST duster (not in UniGene) 
DEME-6 protein 
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EST cluster (not In UniGene) 
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EST duster (not In UniGene) 

hypothetical protein 

EST duster (not in UniGene) 

EST cluster (not in UniGene) 

Homo sapiens mRNA; cDNA DKF2p586 
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321012 AA737314 Hs.194324 EST cluster (not In UnlGene) 

321050 AW393497 EST duster (not In UnlGene) 

321051 AF1 34149 Hs.240395 EST cluster (not in UnlGene) 
321171 AI769410 Hs.221461 ESTs ' 

5 321192 AA295304 Hs.297939 ESTs; Weakly similar to neogenln [H.sap 

321354 AA078493 EST cluster (not in UnlGene) 
321387 H68014 Hs.141278 . ESTs; Weakly similar to Hit ALU SUBFA 

321412 AW366305 Hs.22891 EST cluster (not In UnlGene) 

321489 AW392474 Hs.1 72759 ESTs; Moderately similar to I ill ALU SU 

10 321539 N98619 Hs.42915 ARP2 (actin-related protein 2; yeast) horn 

. 321593 H84762 Hs.253197 ESTs 

321666 028390 Hs.272697 EST cluster (not in UnlGene) 

321891 AW157424 Hs.165954 ESTs 

321910 H67085 Hs.271530 ESTs; Weakly similar to 111! ALU SUBFA 

1 5 321953 AW068268 Hs.292833 ESTs; Weakly similar to !!fl ALU CLASS 

321978 N77342 Hs.21851 EST cluster (not in UnlGene) 

322017 AA310039 Hs.9192 ESTs 

322026 AA233527 Hs.283675 low density lipoprotein receptor (familial 

• 322035 AL137517 Hs.306201 EST duster (not in UnlGene) 

20 322171 AF085968 Hs.48474 EST duster (not in UnlGene) 

322175 AF085975 < EST duster (not In UnlGene) 

322236 AL134970 Hs.104222 follistafrvlike 1 

322303 W07459 Hs.157601 EST cluster (not in UnlGene) 

322735 AA086123 Hs.297856 EST duster (not In UniGene) 

25 322777 AA679082 Hs.269947 ESTs 

322818 AW043782 Hs.293616 ESTs 

322882 AW248508 Hs.279727 DIGeorge syndrome critical region gene 2 

322975 C16391 EST duster (not in UnlGene) 

322991 C18965 Hs.159473 ESTs 

30 323011 AA580288 EST duster (not in UniGene) 

323091 AW014094 Hs.210761 ESTs 

323107 AI301107 Hs.150790 ESTs 

323136 AL120351 Hs.30177 EST duster (not in UnlGene) 

323168 AL120862 Hs.124165 ESTs 

35 323195 AI064982 Hs.117950 multifunctional polypeptide similar to SA 

323201 AL049370 Hs.13350 Homo sapiens mRNA; cDNA DKFZp586 

323203 AA203135 Hs.130186 ESTs 

323243 W44372 Hs.110771 EST duster (not in UniGene) 

323244 T70731 Hs.193620 EST duster (not in UniGene) 
40 323328 AA228078 Hs.255096 EST duster (not in UnlGene) 

323332 AI829520 Hs.227513 ESTs 

323333 AA228883 Hs.208558 EST duster (not in UniGene) 

323570 AL038623 Hs.208752 ESTs; Weakly similar to lilt ALU SUBFA 

323604 AI751438 Hs.41271 ESTs; Weakly similar to llll ALU SUBFA 

45 323685 AA344205 Hs.289088 EST duster (not in UnlGene) . 

323753 AA327102 Hs.70266 EST duster (not in UniGene) 

323817 AA410943 EST duster (not in UniGene) 

323845 A1684674 Hs.41127 ESTs; Weakly similar to wadaw p.melan 

323930 AA570698 Hs.8173 ESTs 

50 323997 AA844907 Hs.274454 EST duster (not in UnlGene) 

324047 AA378201 Hs.271340 EST duster (not in UniGene) 

324261 AL044891 Hs.269350 EST duster (not in UnlGene) 

324302 AA543008 Hs.292471 ESTs; Weakly similar to if II ALU SUBFA 

324338 AL138357 Hs;145078 ESTs 

5 5 324344 AW502000 Hs.46677 EST duster (not in UniGene) 

. 324432 AA464510 Hs. 152812 EST duster (not in UniGene) 

324495 AW50141 1 Hs.122489 ESTs; Weakly similar to llll ALU CLASS 

324497 AW152624 Hs.136340 ESTs 

324598 AA502659 Hs.163986 ESTs 

60 324603 AW016378 Hs.292934 ESTs 

324620 AA448021 Hs.94109 EST duster (not in UniGene) 

324727 AI610425 Hs.19597 ESTs 

324774 AI031771 Hs.132586 ESTs 

324783 AA640770 Hs.2G0994 EST duster (not In UniGene) 

65 324824 A1826999 Hs.224624 ESTs . 

324826 AA704806 Hs.143842 ESTs 

324902 D31323 Hs.271492 ESTs . . 
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CH.21_hsgi|61 17842 


14.7 


327196 




nil /w L p ~ltC0fi*Mj4C 

CH.U i__ns gipoo/44b 


5.1 


327283 




CH.01_hsgi|5867478 


4.3 


327313 




CH.01_hsgi|5867501 


4.8 


327450 




CH.02_hs gi|5867766 


4.1 






CH.06_hsgIl6117819 


6.2 


328304 




CH.07_hsgi|6004478 


5.4 


328492 




CH.07_hs gf]5868455 


7 


328857 




CH.07_hsgi|6381927 


5.2 


329367 




CH XJk gi|5868842 


7.6 


329373 




CHX-hs gi|6682537 


12 


329655 




CH.14_p2 gf|6448516 


4 


329899 


i 


CH.15_p2gi|6563505 


4 


329960 


■ 


CH.16jp2gi|5091594 


7.6 


330084 




CH.19j>2gi|6015302 


4 


330384 M23263 




androgen receptor (dihydrotestosterone re 


5.8 


330385 AA449749 




ESTs; Highly similar to secreted apoptosi 


10.2 


330387 H14624 




ESTs; Highly similar to secreted apoptosi 


4.4 
17.7 


330388 X03363 




HER2 receptor tyrosine kinase (c-erbB-2; 


330409 D50692 


Hs.78221 


omyc binding protein 


10.1 


330460 TIGR:HT544 


Hs.73946 


Endoth 


330486 WI13755 


Hs.833 


interferon-stimulated protein; 15 kDa 


67 


330494 M2S696 


Hs.237868 


interteukin 7 receptor 


6 


330500 M34423 


Hs,79222 


gaiactosidase; beta 1 


13.1 


330510 M75099 


Hs.227729 


FK506-btoding protein 2 (13kD) 


29 


330513 M81057 


Hs.180884 


carboxy peptidase B1 (tissue) . 


38.5 


330541 U22970 


Hs.265827 


multiple UniGene matches 


7.4 


330542 U23942 


Hs.226213 


cytochrome P450; 51 (lanosterol 14-alpha 


15 


330547 U32989 


Hs.183671 


tryptophan 2;3-dtoxygenase 


11 


330551 U39840 


Hs.299867 


hepatocyte nuclear factor 3; alpha 


6.5 


330562 U49082 


Hs.76460 


transporter protein 


7.7 


330573 U62800 


Hs.83393 


cystatin E/M 


4 


330673 D57B23 


Hs.321403 


Sec23 (S. cerevisiae) nomoJog A 


10.5 


330711 AA164687 


Hs.177576 


mannosyl (alpha-1 ;3-)-grycoprotein beta-1 


24.3 


330814 AA015730 


Hs.265398 


ESTs; Weakly similar to trarrsformatton-r 


44.1 


330850 AA075298 


Hs.322710 


ESTs 


4.4 


330874 AA127474 Hs.191157 


ESTs; Weakly similar to llll ALU SUBFA 


8.1 


330884 AA133457 


Hs.102548 


ESTs 


5.2 


330912 AA195936 


Hs.82719 


general transcription factor IIA; 1 (37kD a . 


5 


330924 AA232136 


Hs.159737 


Homo sapiens mRNA; cDNA DKFZp434 


9.1 


330997 H55762 


Hs.9302 


ESTs 


. 7.6 


331014 H98597 


Hs.30340 


ESTs 


13.5 


331024 N32919 


Hs.27931 


ESTs 


9.1 


331046 N66563 


Hs.191358 


ESTs 


10.5 . 


331135 R61398 


Hs.4197 


ESTs 


7.4 


331145 R72427 


Hs.129873 


ESTs; Weakly similar to CYTOCHROME 


41.9 


331148 R73816 


H3.17385 


ESTs 


4.7 


331222 T98531 


Hs.173904 


ESTs 


4.1 


331230 W69807 


Hs.16537 


hypothetical protein; similar to (U06944) 


4.9 


331306 AA252079 


Hs.63931 


dachshund (Drosophila) homolog 


15.1 


331327 AA281076 


Hs.109221 


ESTs 


4.8 


331337 AA287662 


Hs.50495 


ESTs 


7.6 


331341 AA303125 


Hs.23240 


ESTs; Weakly similar to llll ALU SUBFA 


13 


331344 M357927 


Hs.126550 


ESTs 


12.4 


331362 AA417956 


Hs.40782 


ESTs 


6.5 


331363 AA421562 Hs.91011 


anterior gradient 2 (Xenepus laevls) homo 


282 


331376 AA443802 Hs.41007 


ESTs; Weakly similar to cDNA EST yk47 


15.1 


331384 AA456001 


Hs.93847 


ESTs 


7.9 
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331478 N26606 


Hs.40839~- 


ESTs 


•* 

7 


331526 N49987 


Hs.46624 


ESTs 


19.8 


331533 N51517 


Hs.47282 


ESTs 


6.5 


331681 W85712 


Hs.1 19571 


collagen; type (II; alpha 1 (Ehters-Dantos 


13.8 


331686 W88502 ' 


Hs.182258 


ESTs 


9.9 


331750 AA284372 


Hs,1 11471 


ESTs 


5.6 


331751 AA284840 


Hs.143818 


ESTs 


5.8 


331760 AA292721 


Hs.154434 


ESTs; Weakly similar to unknown [H^ap 


7.4 


331763 AA312861 


Hs.96704 


ESTs 


7.8 


331825 AA411144 


Hs.292862 


ESTs 


152 


331890 AA432166 Hs.3577 


succinate dehydrogenase complex; subuni 


24.3 


331952 AA454756 Hs.97837 


ESTs 


5 


332015 AA487910 H32088Q0 


ESTs; Weakly similar to HII ALU CLASS 


10.5 


332043 M490831 


Hs.125056 


ESTs 


11.4 


332060 AA504779 Hs.191402 


ESTs 


13,6 


332071 AA598594 


Hs.205293 


ESTs 


9.1 


332093 AA608794 


Hs.1 12592 


ESTs 


8.8 


332139 AA620669 


Hs.1 12879 


EST 


9 


332219 N22508 


Hs.139315 


ESTs 


7.1 


332225 N33213 


Hs.100425 


i 

ESTs ! 


122 


332246 N57927 


Hs.1 20777 


ESTs; Weakly similar to RNA POLYME 


15.6 


332247 N58172 


Hs.109370 


ESTs 


16.9 


332260 N70Q88 


Hs.138467 


ESTs 


4 


332269 N91279 


Hs.109654 


ESTs; Moderately similar to outer membr 


8.2 


332336 T96130 


Hs.137551 


ESTS 


7.7 


332340 W15495 


Hs.129781 


chromosome 21 open reading frame 5 


14.1 


332347 W60326 


Hs.288684 


ESTs 


4.4 


332362 W93640 


Hs.4779 


ESTs; Moderately similar to similar to AO 


165 


332467 AA489630 Hs.1 19004 


KIAA0665 gene product 


4.8 


332499 M12036 


Hs.323910 


Human tyrosine kinase-type receptor (HE 


10.4 


332513 AA018162 


Hs.154424 


deiodinase; lodothyronine; type li . 


5.6 


332526 AA281753 


Hs.77515 


Inositol 1 ;4;5-triphosphate receptor; type 


19 


332532 N63192 


Hs.1892 


EST; Highly similar to PHENYLETHAN 


153 


332565 AA234896 


Hs.25272 


E1A binding protein p300 


12.3 


332607 R41791 


Hs.36566 


DM domain kinase 1 


11.1 


332640 AA417152 Hs.5101 


protein regulator of cytokinesis 1 


182 


332694 AA262768 


Hs.243901 


KIAA1067 protein 


152 


332702 H9396& 


Hs.75725 


transgelin 2 


4.7 


332705 T59161 


Hs.76293 


thymosin; beta 10 


5.5 


332749 AA479968 


Hs.88251 


aryisulfatase A 


9.8 


332927 




CH22.FGENES.38J 


17.7 


332929 




CH22^FGENES^8J 


4.7 


332930 




CH22J r GENES.38_4 


7.4 


332955 




CH2Z.FGENES.48J2 


5.4 


332958 




CH22.FGENES.48J5 


17.8 


332961 




CH22_FGENES.48.18 


10.6 


332983 




CH22_FGENES,54_5 


4.3 


333009 




CH22.FGENES.61J 


52 


333010 . . 




CH22J r GENES.61_2 


8.1 


333013 




CH22_FGENES.61_5 


8.5 


333108 




CHZLFGENES.79J4 


5.6 


333139 




CH2^FGENES.83J6 


6.3 


333254 




CH22LFGENES.118_ J 2 


6.8 


333305 




CH2^FGENES.137J* 


11.4 


333343 




CH22JGENES.139J2 


5.1 


333388 




CH22.FGENES.144J 


12.7 


333456 




CH22.FGENES.157J 


4.2 


333459 




CH22.FGENES.157.8 


7.6 


333517 




CH2^FGENES.173^2 


8.2 


333585 




CH22_FGENES203_4 


5 


333679 




CH22_FGENES247_6 


4.3 


333743 




CH2^J=GENES264J 


13.4 


333758 




CH22_FGENES.268J 


4 


333767 




CH22J : GENES271_6 


5.6 


333768 




CH22J r GENES271J' 


122 


333769 




CH22.FGENES.271J 


483 


333795 




CH22-FGENES275 1 


6.1 
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333796 


Cn2Z_rGtNnS.Z75_o 


D.O 


333692 


CH22_FGcNcS.292_1 4 




333904 


CH22_FGENES.294_2 


0.5 


333905 


CH22_FGENES.294J3 


9.3 


333921 


CH2a1xGENc52SdJI2 


y.b 


A a aa 

333968 


Cn22_FGENES.307_4 


15.9 


334102 


aiiao P/HPkirO *5ft*J 4»A 

CH22_FGENES.327J50 


7.1 


MA J AAA 

334222 


CH24<_FGENES.360_3 


C 1 

6.7 


SI Aj A AAA 

334223 


CH22_FGENES.360_4 


33.5 


334264 


CH22_FGENES.367jl5 


10.5 


334343 


pi |aa mrkirr* ap 

CH22_FGENES.375_25 


C 4 

6.1 


334360 


A! |AA P/lPltPA A***! A #• 

CH22_FGENES.378_5 


£ J 

6.1 


A A A«ft»A 4 

334784 


A| 1AA I0" > A) J AA A 

CH22_FGENES.432_9 


4.8 


334789 


CH22_FGENES.432-.14 


5.1 


334794 


,r\ | |aa aa .—4 > | «> A AAA A 

CH22_FGENES.434.J2 


7 


A A b a <m jft 

334889 


/-\ l » AA P* » 1 P" JPA A 

CH22_FGENES.452_3 


JO J 

12.4 


335004 


CH22_FGENES.47.2_8 


7.9 


335115 


Al t a* ^AP"a. A J A A A 

CH22_FGENES.496_2 


inn 

183 


335287 


Aft 1 • AA avAA**! ifjA AAA J a 

CH22_FGENES.526_11 


4.5 


335342 


AA ■ ft A ^A i^B AA, IB • AAJftJft J 

' CH22.FGENES.536J 


5.3 


335491 


A| ijk j| bW.JBabwb. gB>4BA-Aft AA 

CH22_FGENES.570_23 


Mi 

24 


335495 


CH22_FGENES.570_28 


mm 

7 


335498 


CH22_FGENES.571_7 


jo ft 
12.2 


335544 


Aftl I Jft Aft ■ *mj*ft JMMftJft jb) 

CH22_FGENES.576_5 


8.4 


335610 


J<ftl 4 AA ft*A Vli fc^P>A*4 B*AA A 

CH22_FGENES.583_4 


j ft c\ 
12.9 


335653 


Aft ft IBA ^A Aftk ^B A ftAAlAft MJBA A 

CH22_FGENES.590j» 


6.7 


335682 


Aft ■ aAA Aft>AJftiAV)ft it* A AAA Jft 

CH22L.FGENES395J2 


J A i 

12.1 


335687 


^A ft ft K ^A ^A II ftAA) Aft. ^Jftjft Aft 

CH22LFGENES.596J 


jf A A 

13,9 


335755 


CH22_FGENES.604_4 


A J A 

11i) 


335782 


m h A ft ftBft AAx Jft 4k Aft A 

CH22_FGENES.609_4 


17.9 


335791 


CH22LFGENES.61 1_7 


A' B A 

27.3 


335809 


1 — — ^— ^a. AA1 K ft AA> Aft A A 4Bft 

CH22_FGENES.617_6 


JA A 

19^ 


335822 


a. B m ^B> A ft ATS Aft Aft A Aft 

CH22_FGENES.619_7 


J> A jI 

19.1 


335823 


A ■ ft A AW A\ AA A ft 4BB> ATA Jft A A Jft 

CH22_FGENES.619_8 


A P> 

4.5 


335824 


A ft ftAAft AA) Aft 4fV*> ft ftftBB J>ft Jft A J% A A 

CH22J r GENES.619_11 


JA A 

40.2 


335825 


Aft ( A A ^B) Aftftb BA>ft ft IjA jftftL Jft A Aft A JJft 

CH22_FGENES.619J2 


A J A 

34.3 


335895 


J>VI ft AA BA>ATftjA»ft ft ^ Jft. AA A A 

CH22_FGENES.635_3 


JA 
102 


335917 


A**l kAA ft* AftSli HAJft AAA At A 

CH22.FGENES.636J3 


A ' 

6 


335920 


A ft ft a A AT»At4fttAAft\ ftftTBI>JBjL AAA A A 

. CH22_FGENES.636J6 


8.8 


336035 


Al ' — — - ^A A>* IA1A AAATBlAk A 

CH22LFGENES.678.6 


r A 

5.9 


336042 


A ft A k — _ k ^^A Aft AAATAAft A 

CH22J : GENES.679J 


A n 

5.8 


336093 


^A K K ^A i^A A kAB AA Aft Aft A Aft 

CH22_FGENES.691_2 


JJ A 

11.6 


336098 


CH22J : GENES.691_5 


SB> A 

7.6 


336150 


CH22.FGENES.706J 


A A 

, 6.3 


336152 


CH2ZJGENES.706.9 


J A A 

10.5 


336416 


— - - - — - ft^A^ft Jft Aft Jft Aft Aft 

CH22J : GENES.823.38 


A 

5 


336444 


.^A K A ^ ^^A i^AA ft PAl. AA Aftk JJftAAl A Ah 

CH22_FGENES.827_10 


A Jft 

4.8 


336449 


Av d - -— -— ^K.^ftk.^A)A IftBJft AAA Aft 

CH22 w FGENES.829 - 6 


1A A 

13.6 


336471 


AV ft k ^± ^Ak k ft AA> A Jft Aft Aft Jft Jft 

CH22_FGENES.829.30 


A A 

6.9 


336512 


Al ft Jft Jft ATA AA ftAB A ft*** /%» AA i *<■ 

CH22J : GENES.834_7 


14 J 

21.4 


336558 


CH22.FGENES.842J3 


82 


336560 


^Z^FGENES^S 


9 


336676 


CH22.FGENES.43-4 


9.4 


336959 


CH22_FGENES^67-^3 


19 


337968 


CH22_EMAC005500.GENSCAN.103-2 


13.4 


338008 


CH22_EM^C005500.GENSCAN.127-9 


15.2 


338057 


CH22_EMiAC0O55O0.GENSCAN.16t>1 


13.9 


338410 


CH22_EM^C005500.GENSCAN^41-6 


8 


338451 


CH22_ENtAC005500.GENSCAN^59^ 


11.6 


338588 


CH22_EM^C005500.GENSCAN.432-1 


10.3 


338665 


CH22_EMAC005500.GENSCAN.464-2 


4.8 


338689 


CH22_EiMAC005500.GENSCAN.47W 


6.7 


338832 


CH22_DJ246D7.GENSCAN.6-9 


4.8 


338980 


CH22_DA59H18.GENSCAN.2-4 


5.1 


339352 


CH22_BA354l12.GENSCAN.29-7 


6.9 


339373 


CH22LBA232E17.GENSCAN.1-29 


4.3 
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TABLE 13A 

i" 

Table 13 A shows the accession numbers for those pkeys lacking unigenelD's for Table 13. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



123619 
103207 
103349 
110856 
113248 
123169 



371681J 
30635.-4 
1 1052.-2 
19346J4 
328626J 
44573.2 



123533 
116480 
132225 
125154 
116475 
102919 



genbankJ\A608751 
genbank_C14088 
genbanKJ\A1 28980 
genbankJ/V38419 

genbankJ166845 
25180.2 



AA602964 AA609200 

X72790 

X89059 

AA992380 N33063 N21418 H79958 R2191 1 H79957 
T63857 AW971220 AA493469 T63699 

AI950087 N70208 R97040 N36809 AI308119 AW967677 N35320 A1251473 H59397 AW971573 R97278 W01059 AW967671 
AA908598 AA251875 AI820501 AI820532 W87691 T85904 U71456 T82391 BE328571 T75102 R34725 AA884922 BE328517 
AI219788 AA884444 N92578 F13493 AA927794 AI560251 AW874068 AL134043 AW235363 AA663345 AW008282 AA488964 
AA283144 AI890387 AI950344 AI741346 AI689062 AA282915 AW102898 A1872193 AI763273 AW1 73586 AW150329 AJ653832 
AI762688 AA988777 AA488892 AI356394 AW1 03813 AI539642 AA642789 AA856975 AW50551 2 AI961530 AW629970 
BE612881 AW276997 AW513601 AW512843AA044209 AW856538AA1 80009 AA337499 AW961 101 AA251669AA251874 
AI819225 AW205862 AI683338 A1858509 AW276905 AI633006 AA972584 AA908741 AW072629 AW513996 AA293273 
AA969759 N75628 N22388 H84729 H60052 T92487 AI022058 AA780419 AA551005 W80701 AW613456 AI373032 AI564269 
F00531 H83488 W37181 W78802 R66056 AI002839 R67840 AA300207 AW959581 T63226 F040Q5 
AA608751 
C14088 
AA128980 
W38419 
N66845 

M21191 AL035748 AA021266 AA323126 AA180515 AI613029 D28356 NM.000034 M11560 AW401425 AW248248 AA012956 
AA323294 W04965 H38759 AA206622 AA580747 AI541 172 AA381075 AA354229 AW402353 AW405575 AW404021 AW406207 
AA075752 AW176066 AA287222 AA195818 T20243 R87945 AA295539 AA402533 AA232419 AA224515 AW401583 AA331367 
AW402140 AW249079 R31488 AA075757 X05236 AW239490 AA338036 AW239495 AA357262 AA431005 AA306726 R33804 
AA216544 AW275288 AA227044 AL038124 AA243300 C03242 AA315615 AL035840 R64336 AA313917 AA018963 AA001385 
AA054395 H30840 AW498825 AA086141 AI557324 AA1 21576 H39128 R77161 AA019688 AA380987 AA348140 AA348257 
AW176086 AA362432 AA171389 AA362416 AA299938 AA319093 AA337972 C04921 AA345696 R89640 AA085425 AA481708 
AA313637 AL039229 H84490 H86153 F00656 AA326668 AA347304 R65890 H41 949 AA339309 AW402002 AW404854 
AA192582 AA1 12802 H09248 N83165 H38367 AA356339 AA455763 R66853 AA294935 H8591 1 AA310414 H93436 N87014 
AA001 186 H83640 AA41 1328 AA317929 C04192 AW406288 U46335 AA323179 AA427649 AI366131 H14328 AA197161 
AA379497 AA31 1816 AA017206 AA001137 AA017420 AA012990 AW163775 AA021397 AA295513 AA355248 AA374921 
AW380419 AA345864 AA318058 AA371711 AA363255 AA057094 R88057 AA394045 AW362741 AA479579 AW362789 
AW362775 AA223624 AW362809 AW362746 AW362753 AW380412 AA057104 AA295271 X06352 AW362771 X12447 R59553 
R87925 AA292864 R83920 F01 120 AA496740 AA852551 AL039209 AA093339 AA333961 N86416 C03325 C03831 AA194750 
AA359270 AA359460 AW245492 AA339400 AA330784 AI908970 N89223 AW361938 A1940063 AA134323 AA371741 F01267 
AW372970 AA341973 AA346098 AW372969 AA337549 AA327342 H93855 AI074079 F291 18 AA852940 F35696 AA345963 
AA079578 AA1 13785 C02989 AA095945 C03145 C05199 AA346024 AA190506 AW361659 AI909845 AW374374 AW374382 
AW374401 AW374373 AW374370 AI909831 AW374367 AA353658 F01041 C02843 AA375948 AW374414 AA213946 AI525039 
H13744 R31007 AA112044 AA134404 H47935 AW177018 AA429768 AA336873 AA112875 H46393 AA191267 D59131 
AW406037 AA055244 AA341880 AA179024 AA308537 AW406985 AW327311 H30301 AA300705 H43788 AW364149 
AA806213 AA481936 C04941 AW375299 M21190 M410818 AA250940 AI354547 AA317422AA250903AI865497 AA890603 , 
AA366197 AW498538 M78072 AW406461 C03092 F00308 H56488 AA336320 AW406501 AA354102 AA382942 AA096393 
AW376B30 AA383446 F01259 AW08138B T94036 AA379643 H43842 AI524063 AA621727 AA379099 AA371417 R66811 
H44129 N84794 F01135 AA477852 AA293062 AW361595 H27194 A1831650 H43253 H24797 AI564680 AA380090 W20057 
AI921586 AI192549 AW090808 H25967 AA918121 A1626060 H20221 AA812572 H42178 AA887222 H96000 C03180 F00946 
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C03986 AI318091 AI860172 AA582179 AI633388 AA557193 R68075 F24105 AW518239 W56622 AI625219 AI925243 
AW468046 AI921828 AA339164 AI144391 AA643334 AA459631 AA873247 AW373432 AA604384 H27600 AI680458 AA159956 
AA610836 AA364298 AW373435 AA604225 W73754 AW087924 AA599776 N89227 AI630871 AI633128 AW514329 AA010455 
AA563928 AI571596 AI128394 W73707 AI423575 AA583809 AA657988 AI950837 AA169782 AA600009 AI885540 AA771884 . 
AI978829 AA505408 AA533937 AA481469 AA610869 AA775241 AW273870 AW070909 AI905695 AA4801 15 AA574051 
AI889185 AA773167 AA331375 AA001437 AA194324 AA194300 AA558632 AI038538 AA411329 AA781570 AI833176 
AA935520 AW074197 AA583063 AW073099 AW001 1 98 AA948025 AA587857 AA191540 AI460085 AA193244 AI538037 
AA515572 AA758587 AI149311 AA508610 AA206409 AA534004 AA994600 AA827543 AI9.16349 AW245129 AW517804 D25663 
AA781985 AA284536 AI819422 H16040 H27531 AA456564 AA645555 AI423596 AA012908 AA889439 AA71631 1 AA968868 
AA320508 AA725731 AA834202 AA935997 AA724815 AA769353 AA594803 F00827 AI342442 AI003519 AKJ02503 AI347597 
AI040946 AA197162 AA987883 AA292865 AW001944 AI64071 1 AW244044 AA456784 F30588 AA290829 H24754 AI978683 
AA483686 AA583939 M121382 AA833831 AA477102 AA977322 AA666379 F35456 AA993537 AI749610 AA226934 AA716204 
AW513025 AA628543 AA583705 F25702 AI368748 AI124097 AI880086 AA477513 AI758834 A1690753 AA477746 F37761 
AA642243 AA159957 AA250844 AA459406 AA427566 F25054 AI569314 AA961665 AI922050 AI759000 AA555236 AA514432 
' AA293474 AA001129 AA826789 AA641390 AA134405 F35585 AA477416 AW193359 AI361315 AA284988 F36340 AI361322 
F26969 AA991922 AA021267 F26973 AI361314.F35891 AI918509 AA250964 AA190992 AA577139 AA865535 AA134324 
AW192842 AI224046 F18975 AA779626 AA856894 AW269997 AW014614 H95554 F31378 AA374868 F26343 AA654007 
AI830942 AA1 13195 F26432 W56652 AA464690 AA055263 AA340654 AA031448 AA976399 AA972526 AA063476 R83921 
T16240 AA533290 N91545 H44053 AA883451 AA513761 AA086477 H09249 F20482 F26737 AA054148 AA857063 AA017259 
AA179789 AA088908 H43704 AA194320 F35950 AI880127 F20441 F32878 AA962483 H39094 H56489 H44621 F19390 
AI302232 F25162 AA826965 AA086052 M917410 AA454513 R59554 AA196755 AA086369 AA079530 H28106 AA243301 
AI025737 AA101239 AA088887 AI214910 AA974886 F16089 F26054 M515092 F33436 F32829 M78061 AA235645 F19715 
F37529 A1811549 AA665180 AA708200 F01124 F32382 AA346220AA627361 F30741 F30010 F28543 AA21 1715 F20245 
AA331222 F25634 F21996 W28215 F2191 1 R65793 AI192566 H20130 H84491 AA719223 AA557435 F16967 F26989 F30353 
AA857159 AA291918 F28234 F20840 F25176 F22437 T27904 AA480355 F19528 R87926 H14286 F27532 AW337864 F28411 
H13692 F25651 AA975454 F24229 F29657 F18024 AA464779 F17588 F34954 AA947328 F18C63 AA657777 AA459644 N91455 
F27850 F29608 F27206 F18418 F31459 F18564 F33496 F16376 F29740 F29843 F29904 F29866 F19135 AW276602 T40337 
F24835 F34672 F26474 AI926215 AA464185 F18217 N26193 AL043256 T41197 F33055 F00386 F29500 F34191 F33297 
AI937207 F22724 F15909 F26232 F18889 AA318627 F29085 AA872104 F17509 F23373 F15660 F17552 F17412 F16863 
F34033 F21515 F17364 F18383 F16546 F17561 F17260 AA292000 F15723 T47438 F16798 F18046 F18319 F17978 F17566 
F34230 F33258 F20860 F17998 AI695701 

118600 genbanK_N69222 N69222 

118952 genbank_N92966 N92966 

120873 genbanLAA358015 AA358015 

113702 genbank_T97307 T97307 

129982 221J267 Z14221 AW381862 M97920 AW401444 Z66542 M29470 AW406502 X61011 M34024 AA327072 214166 Z14167 Z14165 

AW403806 Z14200 M383972 Z14205 Z14201 M18513 Z14202 AW403684 X14584 AF062221 U43760 X65892 X65883 X62107 
Z80847 X65885 X65893 AF062142 X65891 X17675 Z47274 Z47277 Z47276 X65888 Z47275 X62109 AF062140 L01278 
AF062134 AF062139 X81723 Z80840 X81733 X81743 X81744 X81732 Z80843 AW402942 AW403516 X65919 AF062190 
AF062177 AF062222 AF0621 15 Z47240 AF062263 AF062261 AF062223 AF06221 1 Z47238 AW401714 AW404008 AW404991 
AF062280 M99595 Z47214 Z47232 Z47218 M26995 AF062184 X65895 L38433 X81731 Z1 1946 Z47226 AF062205 AF174012 
L01276 AF062168 AF062136 X81755 X81748 AF174019 AF062285 L06924 L1 1699 AW402665 L09070 L28049 L08090 
AW407843 214171 AW402944 L08083 AW405627 L33035 L26907 M17750 L28052 M17751 AJ239360 U19885 L14821 X56526 
AB014341 L12087 L12098 U6B231 L12184 AF062242 AF022000 U64499 U00570 AI268604 Y15773 X64239 X62969 U00506 
X73605 X99360 U00577 D83677 AB021539 AF035796 Z33899 U00588 AJ239353 AF174062 Z33901 X98899 AF174058 X63080 
D83676 AB021529 Z18318 U004B8 L01412 X81746 U21262 U21272 U00560 AF174060 U00547 U00561 Z18321 S73957 
M26435 AF115130 AF115117 Z92896 U21254 Z92895 AF115112 AF062290 T28938 Y09386 AF174067 U27189 Y09384 
U77373 AF174057 M17749 X69692 AF174038 U64478 U64486 AF174098 X99365 AF174077 AF174083 AF174089 AF015133 
AF127792 AJ006171 U00510 U58144 AF004323 AF1 15109 AF004324 AF015130 AF090414 AF090418 AB021536 AB021519 
Z96957 AF021986 Z92898 Z96956 L34164 AF062251 AF052524 U00549 U00541 D83687 AF052522 AF087420 AF052520 
AF022011 L43085 U00536 L19915 L43083 AF090420 AB021530 AB021534 Z96955 Z96954 Z33903 X62964 AF103282 U00509 
AF062298 AF062289 AF062307 AW408326 L33037 L04337 L04328 X81742 L04336 L03385 AF032360 S56184 AF062191 
AF191092 AI906954 A'F103184 L04343 AW364860 AF001424 AF103163 Z98717 AF103143 AJOO8250 L04323 AF103321 
AF103335 L04342 L03B18 L03817 AW404978 AW403690 U86801 AF103150 AJ010334 AF035027 AJ007327 AF103115 
' AF017458 AJ008207 AJ0081B3 AJ008196 AJ008241 AJ008208 AF103210 AF068668 AF068670 AF068671 AF068664 
AF068669 AF068666 AF068665 AF068672 AW371244 AW403670 AW408074 AW404575 AW362153 AW403803 AW406702 
AW351514 D78345 T29140 J00231 NMJJ02179 AW405146 AA301091 X04646 H64660 AW402990 AW406534 T93007 
AI857980 AW368899 A1905833 AW406586 AA482084 A1872299 AA715266 AW404328 AI831674 AI709348 AA6031 12 
AW514864 AA485775 H64492 AW404789 AA487630 AA715498 AA295885 T27613 T98113 

115763 genbankJ\A421560 AA421560 

124357 genbanK_N22401 . N22401 . 

108733 504187J AA121 022 AA1 26422 

101544 entrez^M31169 M31169 

124447 genbank_N48000 N48000 

124677 genbank_R01073 R01073 

124777 genban!eR41933 R41933 

119302 genbanR_T25725 T25725 
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103680 entreiLZ93784Z93784 
135029 H5B818_at H58818 
112253 genbanLR51818 R51818 
Pkey CAT number Accession 
5 322175 46877 1 ' AF085975 H53458 H53459 

323011 139750 1 AA580288AA315655AA133031 AA377748 
322975 1510563J C16391 C16413 

317210 211994J AW881145AA490718 M85637AA304575 T06067AA331991 

323817 233566 J AA410943 AW948953AA334202AA332882 
10 309583 1046029.-2 AW170035 

324961 376239J AA613792 AW182329 T05304 AW858385 

303502 325188 1 BE174240 AA488528 AL042253 

320133 447553 1 BE151746 BE336853 D63271 T94955 AA774994 

311935 174129J AA216387 T63548AA228676 
15 321050 502195 1 C05928 AW393497 

319977 345248J AA534222AA632632T81234 

312772 4380_7 AW962299 AA310349 AW962294 H63791 H63751 

321354 116028.-2 AA078493 

336512 CH22J941FG.834_7JJNK.DJ 
20 336558 CH22 3992FG_842XUNK£U 

336560 CH22.3994FG_842_5_UNK.DJ 

329367 cjLhs 

329373 cjehs 

336676 CH22_4154FG_43_4. 
25 338008 CH2L6490FG__UNK_EM:ACQ0 

338057 CH2L6558FG_.UNK.EM:AC00 

329655 c14_p2 

336959 CH22_4764FG_367J3_ 

329899 c15_p2 
30 329960 c16_p2 

338410 CH2L7067FG.JJNK_ENfcAC00 

338451 CH22_7124FG_JJNK_EM:ACQ0 

338588 CH22_7331FG__UNK.EM:AC00 

338665 CH22.7438FG__UNK_EM;AC00 
35 338689 CH2L7464FG_JJNK.EM:AC00 

308677 AI761173 . 

338832 CH22_7678FG__UNKJ)J246D7 
338980 CH22_7859FG_JJNK.DA59H18 
333009 CH22J33FG.61JJJNK_EMA 
40 333010 CH22_234FG_61_2.LINK.EMA 
333013 CH22_237FG_61_5_UNK£MA 
308981 AI873242 
308995 AI880172 

333108 CH22_336FG.79J4_LINK.EM: 
45 333139 CH22_368FG_83_16_LINK_EM: 

333254 CH22.495FG_118JLLINK.EM: 

333305 CH22_550FG_137_2JJNK.EM: 

333343 CH2a.589FGJ39J2J.iNK.EM 

333388 CH22_634FG_144.3JJNK.EM: 
50 326213 c17_hs 

333456 CH22.706FG_157_5_LINK.EM: 

333459 CH22_709FG_157.8JJNK.EM: 

333517 CH2L773FG_173_2.LINK.EM: 

333585 CH2L846FG_203.4_L1NK:EM: 
55 333679 CH22_941FG.247_6JJNK.EM: 

326474 c19_hs 

333743 CH22J 009FG_264_1JJNK_EM 
333758 CH22J 024FG_268J_LiNK£M 
333767 CH22J034FG_271_6_UNKJEM 
60 333768 CH2L1035FG_271_7JJNK_EM 
333769 CH22J036FG_271_8 LINK.EM 

333795 CH22_1063FG_275.1 LINK^M 

333796 CH22J065FG_275_3JJNK_EM 
335004 CH22_2326FG_472 8 UNK.EM 

65 333892 CH22J163FG_292J4JJNK_E 
335115 CH22J447FG_496.2JJNK.EM 
333904 CH22.1176FG.294JLUNK.EM 
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333905 CH2U177FGJ94JJJNKJM 
333921 CH22J194FGJ96J2_UNK_= 
333958 CH22J245FGJ07_4JJNK^M . 
328059 C_6_hs 

335287 CH22J629FG_526J1JJNK_E 

326816 c20_hs 

326817 c20_hs 

335342 CH22J689FG_536_1_UNK_EM 
335491 CH22J843FG_57Q_23_UNK_E 
335495 CH22J847FG_570J8_LINK_E 
335498 CH22J850FG_571_7JJNK_EM 
328304 c_7_hs 
305453 AA738110 

335544 CH22J899F£.576_5JJNK.EM 
15 335610 CH22J969FG_583.4_UNK.EM 
335653 CH2^3013FG.590_4_UNK_EM 
335682 CH2ZJ043FG_595J_UNieEM 
335687 CH22J048FGJ96JLUNK.EM 
328492 c_7Jw 
20 335755 CH22J122FG_604_4_UNieEM 
335782 CH22J151FG_609_4.UNK.EM 
335791 CH22_3160FG_611_7JJNK_EM 
335809 CH22J181FG_617_6_UNK»EM 
335822 CH22_3195FG_619_7JJNK_EM 
25 335823 CH22J196FG_619JJJNK_EM 

335824 CH22_3197FG.619_11_UNK-E 

335825 CH22J198FG.619J2J-INK-E 
335895 CH22J272FG_635J_UNK_EM 
335917 CH22_3294FG.636J3_LINK_E 

30 335920 CH22J297FG_636J6JJNK_E 

305898 AA872838 

305913 AA876109 

305950 AA884479 

328857 c_7_hs 
35 330084 c19j)2 

337968 CH22_6419FG_UNK_EM:AC00 

309177 AI951118 

309198 AI955915 

309226 AI969897 
40 339352 CH22J317FGJJ.INK3A354I1 

309279 AI990102 

339373 CH22J34«FG_UNK.BA232E1 
325622 c14_hs 

334102 CH22J38QFGJ27.60.UNKJ . 
45 332927 CH22.148FG_38_1.UNK.C20H 

332929 CH22_150FG_38_3.LINK_C20H 

332930 CH22_151FG_38_4_L1NK_C20H 
332955 CH22_179FG_48J2_LINK_EM: 
332958 CH22J82FG_48J5JJNK_EM: 

50 332961 CH22_185FG.48J8JJNK.EM: 
332983 CH22J07FG.54.5J.INK_EMA . 

334222 CH22J506FGJ60JJJNK_EM 

334223 CH22_1507FGJ6O_4.UNK.EM 
334264 CH22_1551FGJ67J5JJNK_E . 

55 327110 c21.hs 

. 334343 CH22.1636FG_375J5.UNK_E 

334360 CH22J654FGJ78.5J.INICEM 

327196 QjJ* 

327283 qJJ» 
60 327313 cJJts 

304465 AA421948 

304507 AA456426 

327450 cjjw 

304591 AA505702 
65 304601 AA507875 

304659 AA533185 

334784 CH22J096FG_432J.UNK.EM 
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334789 CH22J2101FGJ32J4_UNKj 

334794 CH22J106FGJ34JLUNKJM 

336035 CH22J420FG_678_6JJNKJXI 

336042 CH22.3427FG 679_4_UNKJ>J 

336093 CH22JM81FG 691JJJNK.DJ 

336096 CH22 3484FG 691 JJJNKJX) 

334889 CH22.2206FG 452_3JJNK_EM 

336150 CH22_3540FGJ06J_UNKJ)A 

336152 CH22_3543FGJ06_9_UNKJ>A 

336416 CH22_3833FG 823J8_UNK_B 

336444 CH22.3864FGJ27J0J.INK.D 

336449 CH22J870FG 829_6JJNKJW 

336471 CH22_3894FG_829J0JLINKJ> 
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TABLE 13B 



Table 13B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 13B. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey. 
Ref; 

Strand: 
NLposWon: 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (61) numbers. "Dunham I. et aL' refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et at, Nature (1999) 402:489-495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Pkey Ref 



Strand Ntjwsitlon 



332955 
332958 
332961 
333139 
333254 
333305 
333388 
333517 
333585 
333679 
333767 
333768 
333769 
333795 
333796 
333892 
333921 
333968 
334102 
334264 
334343 
334794 
334889 
335287 
335491 
335495 
335498 
.335653 
335687 
335809 
335822 
335823 
335824 
335825 
336035 
336093 
336096 
336444 
336959 
338008 
338057 
338410 
338568 
338665 
338832 
338980 
339352 



Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 
Dunham, 



I. eta). Plus 

I.etaL Plus 

I. etaJ. Plus 

I.etaL Plus 

I.etaL Plus 

l.etal. Plus 

l.etai. Plus 

l.etal. Plus 

l.etal. Plus 

l.etal. Plus 

l.etal. Plus 

l.etal. Plus 

l.etal. Plus 

l.etal. Plus 

I.etaL Pius 

l. etal. Plus 

I.etaL Plus 

I.etaL Plus 

l.etal. Plus 

I.etaL Plus 

I.etaL Plus 

I.etaL Plus 

l.etal. Plus 

l.etal. Plus 

I.etaL Plus 

l.etal. Pius 

l.etal Plus 

l.etal. Plus 

I. etaL Plus 

l.etal. Plus 

I.etaJ. Plus 

l.etal. Plus 

I.etaJ, Plus 

Let at Pius 

l.etai. Plus 

l.etal. Plus 

I.etaJ. Plus 

I. eta). Plus 

l.etal. Pius 

I.etaL Plus 

l.etal. Pius 

I. eta). Plus 

I.etaL Plus 

I.etaJ. Plus 

l.etal. Plus 

I.etaL Plus 

I.etaL Plus 



2508896-2508992 

2516164-2516310 

2521424-2521555 

33694954369571 

2521424-2521555 

46303884630645 

4913749-4913805 

5570729-5570925 

6234778-6234894 

7068795-7068896 

7694407-7694623 

7695440-7695697 

7696625-7696707 

7807688-7807795 

7808253-7808319 

81568254157001 

83803254380441 

86810044681241 

9995140-9996373 

13234447-13234544 

13655828-13656307 

16374312-16374458 

19286024-19286515 

22299047-22299299 

24128651-24128827 

24140688-24140872 

24172082-24172161 

25329710-25329802 

25445952-25446064 

26310772-26310909 

26364087-26364196 

26365925-26366004 

26376860-26376942 

26378175-26378268 

29016748-29017410 

29556922-29557002 

29578878-29579047 

3419058544190718 

13233040-13233126 

7697068-7697236 

85263974526522 

19292807-19292916 

22896767-22896920 

24472654-24472853 

27775128-27775290 

29896789-29896874 

3354478443545121 
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332929 


Dunham, 


I. BiaL 


Mini w 

Minus 


. 2U2U/&^2Ua*JoM 


332930 


Dunham, 


1 a* nt 

I. etai. 


ft Jim AM 

Minus 


2022555-2022497 


332983 


Dunham, 


1 A* aI 

L etai. 


ft 111.. AA 

Minus 


2631933-2631797 


333009 


Dunham, 


1. etai 


Minus 


2766043-2765856 


333010 


Dunham, 


1. etai. 


Minus 


2766207-2766119 


333013 


Dunham, 


1. etai. 


Minus 


2772278-2772039 


333108 


Dunham, 


• L ft 

1. etai. 


Minus 


A a. ft a j a a a a jftnnA 

3240494-3240389 


AAA A AM 

333343 


n ft 

Dunham, 


ft ft ■ 

1. etai. 


Minus 


46928B6-4692753 


A AM J A A 

333456 


Dunham, 


1 A. —1 

1. etai. 


Minus 


2631933-2631797 


333459 


Dunham, 


1. etai 


Minus 


5144548-5144344 


333743 


Dunham, 


1. etai. 


Minus 


•imtAi A A^AA A A 

7573216-7573060 


333758 


Dunham, 


1. etai 


ft Jr* 

Minus 


— • A A A ft A A fftAAAAAA- 

7666413-7666091 


AAAAA J 

333904 


Dunham, 


1. etai. 


Minus 


8217374-8217261 


333905 


Dunham, 


1. etaL 


Minus 


MAjwA/ft A A, J ■ ■ A ft A 

8217796-8217670 


334222 


Dunham, 


1. etai 


Minus 


A MAA ft j ^ J A*|AAMA 

12732417-12732289 


334223 


Dunham, 


1. et.al. 


Minus 


j A. ■ A A A A W J A ft A AA 

12734365-12734269 


334360 


Dunham, 


1. etai. 


Minus 


j A<*>* A A A 1" A J A*-» A A A 

13728850-13728751 


334784 


Dunham, 


i. etai. 


Minus 


j AAA la jit * AAA ft AAA 

16294548-16294360 


334789 


Dunham, 


[. etai. 


Minus 


a ftflAAAAr ^ A A A ^ A A A 

16306095-16305996 


335004 


Dunham, 


1. etai. 


Minus' 


AAA A A A j) A J~l_r> pni^A J 

20581911-20581794 


335115 


Dunham, 


1. etai. 


Minus 


AJA AAAAA A.AAAAA ft A 

21388250-21388146 


335342 


Dunham, 


1. etai. 


Minus 


#*A^A»«ft ft A J*|_r*> | f i ■ AA J 

22597448-22597284 


335544 


Dunham, 


a ft ft 

1. etai. 


ft M* 

Minus 


A J ApAr Ar AlApA ft AA 

24650505-24650403 


335610 


Dunham, 


1. etai. 


ft Aft 

Mmus 


25068943-25068841 


AA AAA 

335682 


Dunham, 


i.etai 


A A*. 

Mmus 


A|> JAiAiP A ft AJ AAA 

25421215-25421093 


335755 


Dunham, 


1. etai. 


a •» 

Minus 


AmAACIAA A«*i «AA*V J«ff 

25763806-25763747 


335782 


Dunham, 


1. etai. 


Minus 


A rAAAftA A AA A ft ft A 

25908578-25908440 


335791 


Dunham, 


I.etai. 


Minus 


A ^ A ft A ^ A A AFA ft A ft J -ft 

25948563-25948411 


335895 


Dunham, 


1. etat 


Minus 


AAAAlAAAA> A A A ftl AAA 

26975307-26975239 


335917 


Dunham, 


1. etai. 


Minus 


j>*_ a A* A* A A A Aft>^A A A. A AAAAA 

27028481-27028377 


335920 


Dunham, 


1. etaL 


Minus 


jlBMIt IA Att An JftA AAA J 

27034927-27034811 


336042 


Dunham, 


1. etai. 


Minus 


AAA ft A: A A ft A\A A ft J ^ A; A 

29041694-29041500 


336150 


Dunham, 


1. etaL 


Minus 


A A, A ft>> #% ft AM A. AH. A. .A* ^ Aftftl V* 

30150423-30150256 


336152 


Dunham, 


1. etai. 


Minus 


Aft A A Aft A A AX A A A Al M ■■ ftftjktl A 

30156053-30155870 


336416 


Dunham, 


1. etai. 


Minus 


AN ft A ft >M A Aft, A A ft A ft A A A 

34047408-34047311 


336449 


Dunham, 


1. etaL 


Minus 


A ft A. A At^BS Aft^sa Aft A AA Aft ft AftBftft 

34204707-34204577 


336471 


Dunham, 


1. etat. 


Minus 


A A A A A A A A A A Aft A J A MA A 

34215091-34214976 


336512 


Dunham, 


L etai. 


Minus 


34278373-34278275 


336558 


Dunham, 


1. etai. 


Minus 


A ■ Ah Al A A A A Aft j A L ■ j jri ■■. Ai 

34375825-34375698 


336560 


Dunham, 


1. etai. 


Minus 


A A A A A A A AA 1 AWANAA 

34376814-34376596 


336676 


Dunham, 


1. etai. 


Minus 


Aft AAAftnvA A* Aft ftA ftft AA A Aft M 

2022565-2022497 


337968 


Dunham, 


1. etaL 


Minus 


7095797-7095680 


338451 


Dunham, 


1. etaL 


Minus 


^A — * — v ^A Ah a AA ft A ^ft A 

20174286-20174193 


338689 


Dunham, 


1. etai. 


Minus 


^A. m A^ ^A ^A ^A ^k^k ^A ft A ^A Afe AmA 

24893073-24692972 


339373 


Dunham, 


1. etaL 


Minus 


^A aa* ^a a ^k^H AA Aft Aft AA Aft Aft ft ^A> 

33860127-33860047 


325622 


5867000 




Plus 


69994-70075 


329655 


6448516 




Minus 


35565-35843 


329899 


6563505 




Minus 


A J A ^A ■ A A A A 4A Aft 

111058-111783 


329960 


5091594 




Minus 


1031-1162 


326213 


5867224 




Minus 


60751-60927 


326474 


5867405 




Plus 


16995-18101 


330084 


6015302 




Minus 


57019-59337 


326816 


6552458 




Pius 


198354-198436 


326817 


6552458 




Plus 


199909-200001 


327110 


6117842 




Pius 


94608-84785 


327196 


5867446 




Plus 


180921-181333 


327283 


5867478 




Minus 


567-962 


327313 


5867501 




Minus 


89734-89838 


327450 


5867766 




Minus 


47928-48076 


328059 


6117819 




Plus 


37052-37204 


328492 


5868455 




Minus 


46094-46241 


328304 


6004478 




Minus 


3884-3952 


328857 


6381927 




Minus 


80557-81051 


329367 


5868842 




Minus 


87201-87587 


329373 


6682537 




Minus 


38950-39301 



♦ * 
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TABLE 14: Table 2 from BRCA 001-5 US 

Table 14, a subset of table 13, depicts a preferred group of genes highly upregulated in breast 
cancer cells. 



10 



Pkey: 
ExAccn: 
UnigenelD: 



R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unlgene number 



Ratio of tumor to normal breast tissue 



15 Pkey ExAccn UniGenelD Unlgene Title 

100038 M97935 AFFX control: STAT1 

100114 D00596 Hs.82962 thymfdylate synthetase 

100975 J02923 Hs.76508 lymphocyte cytosolic protein 1 (L-plastin) 

20 101031 J05070 Hs.151738 matrix metalbproteinase 9 (gelatinase B; 9 

101104 L07615 Hs.169266 Human neuropeptide Y receptor Y1 (NPYY 

101143 L12723 Hs.90093 heat shock 70kD protein 4 

101332 L47276 Hs.156346 Homo sapiens (cell line HL-6) alpha topbis 

101378 M13755 Hs.833 Interferon-stimulated protein; 15 kDa 

25 101809 M86849 Hs.323733 Homo sapiens connexin 26 (GJB2) mRNA 

102618 U65932 Hs.81071 extracellular matrix protein 1 

102721 U79241 Hs.1 18666 Human done 23759 mRNA; partial cds 

102817 U90904 Hs.83724 Human done 23773 mRNA sequence 

102907 X06985 Hs.202833 heme oxygenase (decycDng) 1 

30 102985 X17644 Hs.2707 G1 to S phase transition 1 

103060 X57766 Hs.1 55324 matrix metalloproteinase 1 1 (stromelysin 3 

103180 X69433 Hs.5337 isodtrate dehydrogenase 2 (NADP+); mito 

103208 X72755 Hs.77367 monokine Induced by gamma interferon 

103821 AA157623 Hs.198793 KIAA0750 gene product 

35 104115 AA428090 Hs.26102 ESTs 

104667 AA007234 - Hs.30098 ESTs 

105186 AA191512 Hs.28005 Homo sapiens mRNA; cDNA DKF2p564G 

106103 AA421104 Hs.1 2094 ESTs 

107151 AA621169 Hs.8687 ESTs 

40 109415 AA227219 Hs.1 10826 trinudeotide repeat containing 9 

110189 H20543 Hs.6278 DKFZP586B1 621 protein 

110561 H59617 Hs.5199 ESTs; WeaWy similar to UBIQUITIN-CON 

110734 H98714 Hs.24131 ESTs 

110915 N46252 Hs.29724 ESTs 

45 111179 N67239 Hs.10760 ESTs 

111357 N91023 Hs.87128 ESTs 

112134 R46025 Hs.7413 ESTs 

113970 W86748 Hs.8109 ESTs 

114124 238595 Hs.125019 ESTs; Highly similar to KIAA0886 protein 

50 114292 240715 Hs.184641 detta-6 fatty acid desaturase 

114901 AA236276 Hs.196437 ESTs; Weakly similar to R26660_1; partial 

114965 AA250737 Hs.72472 ESTs 

115652 AA405098 Hs.38178 ESTs 

115875 AA433943 Hs.43946 ESTs; Weakly similar to Weak similarity t 

55 116790 H29532 Hs.101174 microtubule-assodated protein tau 

116921 H72948 Hs.821 blgrycan 

117412 N26722 Hs.42645 ESTs 

120241 241815 . Hs.65946 ESTs 

120325 AA195651 Hs.104106 ESTs 

60 121596 AA416740 Hs.174104 ESTs 

123619 AA609200 ESTs 

124006 D60302 Hs.270016 ESTs 

125852 H09290 Hs.76550 Homo sapiens mRNA; cDNA DKF2p564B 

126160 N90960 Hs.265398 ESTs; Weakly similar to transformation-re! 

252 



R1 

16.7 

15.9 

30.1 

37.2 

18.3 

17:4 

18.9 

18.1 

22.5 

23.2 

15 

15.2 

22.7 

20.6 

17.8 

18.9 

15.1 

23.3 

28.7 

16.6 

19.3 

15.4 

19. 

20.1 

16.6 

19.5 

30.2 

23.2 

37 

15 

17.4 

15 

22 

19.4 

16.9 

35.1 

16.1 

33.5 

711 

20.7 

18.1 

15.6 

152 

22.6 

23.1 

20.6 

25.9 

16.4 
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15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



127677 
128595 
128717 
129124 
129366 
130455 
130604 
130913 
130944 
131472 
131562 
132180 
132406 
132465 
132994 
133294 
133634 
134374 
134405 
134470 
134495 
134714 
135237 
301884 
302276 
302290 
309177 
309583 
310438 
311166 
312153 
313915 
314506 
314558 
314691 
314943 
315196 
316177 
318073 
318662 
318740 
318744 
319668 
320074 
320211 
320727 
322818 
322882 
324261 
324432 
324603 
324620 
324988 
330388 
330486 
330814 
331145 
331306 
331890 
332526 
332532 
332694 
332958 
333769 
333968 
334223 
334264 



AA916752 Hs.264190 

U31875 Hs.1 52677 

T30617 Hs.104222 

AA234530 Hs. 108802 

H18027 Hs.184697 



X17059 
X03635 
W03592 
M97935 



Hs.155956 
Hs.1657 
Hs.21198 
H&21486 



AA608962 Hs.27258 

U90551 Hs.28777 

AA405569 Hs.418 

F09979 Hs.4774 

AA047896 Hs.49169 

AA505133 Hs.279905 

R79723 Hs.69997 



U24166 
D62633 
J04177 
X54942 
D63477 
U89922 



Hs.234279 

Hs.8236 

Hs.82772 

Hs.83758 

Hs.84087 

Hs.890 



AA454930 Hs.9691 
AA312082 Hs.105445 
NM_004448Hs.323910 
AL117607 Hs.175563 
AI951118 
AW170035 

AW022192 Hs.200197 
AI821294 Hs.1 18599. 
AA759250 Hs.153028 
AI969390 Hs.163443 
AA833655 Hs.206868 
AI873274 Hs.190721 
AW207206 Hs.136319 
AI476797 Hs,1B4572 
AA972756 Hs.44898 
AI908272 Hs.293102 
AW167087 Hs.131562 
AI285898 Hs.294014 
NMJ)02543Hs77729 
AI793124 Hs.144479 
NM_002731Hs.87773 
AA321166 Hs.278233 
AL039402 Hs.125783 
U96044 Hs.181125 
AW043782 Hs.293616 
AW248508 Hs.279727 
AL044891 Hs.269350 
AA464510 Hs.152812 
AW016378 Hs.292934 
AA448021 Hs.94109 
T06997 Hs.121028 
X03363 

M13755 Hs.833 
AA015730 Hs.265398 
R72427 Hs.129873 
AA252079 Hs.63931 
AA432166 Hs.3577 
AA281753 Hs.77515 
N63192 Hs.1892 
AA262768 Hs.243901 



ESTs; Highly similar to MEM3 [M.muscul 


17.3 


short-chain alcohol dehydrogenase family m 


27.1 


Homo sapiens mRNA; cONA DKFZp566L 


24.5 


N-ethytmalelmlde-sensiUve factor 


20.7 


plexfnCI 




N-acetyltransferase 1 (arylamine N-acetylt 


26.4 


estrogen receptor 1 


39.9 


translocase of outer mitochondrial membra . 


20.9 


signal transducer and activator of transcript 


18.8 


catcycfin binding protein 


18.1 


H2A histone family; member L 


18.8 


fibroblast activation protein; alpha; seprase 


15.4 


ESTs 


15 


ESTs 


15.4 


solute carrier family 2 (facilitated glucose t 


26.4 


zinc finger protein 238 


30.4 


microtubute-assoclated protein; RP/EBfam 


15.2 


ESTs 


■ 15.2 


collagen; type XI; alpha 1 


15.3 


CDC28 protein kinase 2 


' 20.3 


KIAA0143 protein 


16.1 


lymphotoxin beta (TNF superfemily; memb 


35.7 


ESTs 


19.5 


GDNF family receptor alpha 1 


20.7 


EST cluster (not in UniGene) with exon hit 


21.6 


Homo sapiens mRNA; cDNA DKFZp564N 


41.4 


EST singleton (not in UniGene) with exon 


24.3 


EST 


64.5 


ESTs 


39.1 


ESTs 


24.1 


cytochrome b-561 


27.1 


ESTs 


27.1 


ESTs 


27.8 


ESTs 


22.5 


ESTs 


21.4 


ceil division cycle 2; G1 to S and G2 to M 


18.4 


ESTs 


28.8 


EST cluster (not in UniGene) 


32.6 


ESTs 


157 


ESTs 


' 16.3 


EST cluster (not in UniGene) 


21.3 


ESTs 


35 


EST cluster (not in UniGene) 


25.4 


EST cluster (not In UniGene) 


. 16.7 


DEME-6 protein 


24.3 


EST cluster (not in UniGene) 


153 


ESTs 


21 


DiGeorge syndrome critical region gene 2 


15.3 


EST cluster (not in UniGene) 


50.1 


EST cluster (not in UniGene) 


16.7 


ESTs 


23.1 


EST cluster (not in UniGene) 


212 


EST cluster (not in UniGene) 


24.5 


HER2 receptor tyrosine kinase (o-erbB-2; E 


17.7 


(nterferon-stimulated protein; 15 kDa 


67 


ESTs; Weakly similar to transforrnatiorwel 


44.1 


ESTs; Weakly similar to CYTOCHROME 


41.9 


dachshund (Drosophila) homolog 


15.1 


succinate dehydrogenase complex; subunit 


243 


inositol 1;4;5-triphosphate receptor; type 3 


19 


EST; Highly similar to PHENYLETHANO 


153 


KIAA1067 protein 


15.2 


CH22.FGENES.48J5 


17.8 


CH2^FGENES^71j 


483 


CH22J : GENES307_4 


15.9 


CH22.FGENES.360.4 


33.5 


CH2ZJ : GENES.367J5 


183 



253 
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M 



335791 
336512 
338008 



CH22_FGENES.611J 
CH22.FGENES.834J 
CH22JM:AC005500.GENSCAN.127-9 



27.3 
21.4 
15.2 



254 
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TABLE 14A 

>• 

Table 14A shows the accession numbers for those pkeys lacking unigenelD's for Table 14. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



10 



15 



20 



25 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 

309583 1046029.-2 AW1 70035 

336512 CH22_3941FGJ34_7JJNKJXJ 

338008 CH22J490FG_UNK_EM:AC0Q 

333769 CH22J036FG_271_8JJNK_EM 

333968 CH22J245FGJ07JJJNK„EM 

335791 CH22J160FG_611J_UNK_EM 

309177 AI951118 

332958 CH22J82FG_48J5 LINK-EM: 

334223 CH22J507FGJ60JJJNK.EM 

334264 CH22J551FG_367_15jJNr^E 

123619 371681J AA602964AA 



255 
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TABLE 14B 

5 Table 14B shows the genomic positioning for those pkeys lacking unigene ED's and 
accession numbers in Table 14. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

1 0 Ptey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. 'Dunham I. et al." refers to- the publication 

entitled "The DNA sequence of human chromosome 22." Dunham 1. et al., Nature (1 999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted axons. 



Pkey Ref Strand NLposition 



332958 Dunham, I. eta). Plus 2516164-2516310 

20 333769 Dunham, I. eta). Plus 7696625-7696707 

333968 Dunham, L etal. Plus 8681004-8681241 

334264 Dunham, I. etal. Plus 13234447-13234544 

338008 Dunham, I. etal. Plus 7697068-7697236 

334223 Dunham, I. etal. Minus 12734365-12734269 

25 335791 Dunham, I. etal. Minus. 25948563-25948411 

336512 Dunham, I. etal Minus 34278373-34278275 
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TABLE 15: Table 3 from BRCA 001-5 US 



Table 15 shows genes downregulated in breast cancer cells. 



Pkey: 
ExAccn: 
UnjgenelD: 
10 Unigene Title; 
R1: 



Unique Eos pnobeset identifier number 

Exemplar Accession nurnber, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of normal breast tissue to tumor 



15 



Pkey ExAccn UniGene IDUnigene Title 



20 



25 



30 



35 



40 



45 



50 



55 



60 



100115 
100499 
100502 
100815 
101125 
101367 
101397 
101883 
102227 
102857 
103211 
103496 
103562 
104672 
105083 
105138 
106075 
106870 
107099 
107616 
107997 
108604 
111130 
111837 
112538 
112808 
113086 
115740 
115949 
115965 
117224 
117513 
119059 
119175 
119359 
119798 
120889 
121381 
121750 
122127 
122348 
122485 
123443 
123505 
125284 
126300 
126747 
127218 
127357 



D00632 

TIGR-.HT1428 

TIGR:HT1496 

TIGR:HT4268 

110373 

M12963 

M15856 

M98399 

U25138 

X00129 

X73079 

Y09267 

Z21966 

AA007629 

AA146619 

AA164519 

AA417915 

AA487576 

AA609645 

AA004901 

AA037388 

AA099820 

N64265 

R36447 

R70255 

R97970 

T40652 

AA418033 

AA443800 

AA446661 

N20300 

N32174 

R15436 

R71792 

T71021. 

W73386 

AA365784 

AA405747 

AA421184 

M434447 

AA443695 

AA448300 

AA598841 

AA600135 

W94688 

D81972 . 

R72515 

AA309765 

AA45278B 



Hs.172153 

Hs.283108 

Hs.1 69228 

Hs.9739 

Hs.82749 

Hs.4 

Hs.180878 

Hs.75613 

Ks.93841 

Hs.76461 

Hs.288579 

Hs.132821 

Hs.2815 

Hs.1 8791 

Hs.15248 

Hs.25930 

Hs.26530 

Hs.211568 

Hs.261164 

Hs.82223 

Hs.49696 

Hs.19515 

Hs.24453 

Hs.281022 

Hs.209100 

Hs.283559 

Hs.43125 

Hs.173233 

Hs.218707 

Hs.44317 

Hs.77889 

Hs.301002 

Hs.285681 

Hs.249129 

Hs.97044 

Hs.97984 

Hs.97549 

Hs.106771 

Hs.293410 

Hs.160318 

Hs.167382 

Hs.103253 

Hs.160318 
Hs.1 16017 
Hs.75432 



glutathione peroxidase 3 (plasma) 
Globin.Beta 

Adrenal-Specific Protein Pg2 
L-Glyceroin3-Pnosphate:Nad+ Oxidoreduct 
transmembrane 4 superfamily member 2 
alcohol dehydrogenase 1 (class l); alpha po 
lipoprotein lipase 

CD36 antigen (collagen type I receptor, thr 
potassium large conductance caldum-activ 
retinokbinding protein 4; Interstitial 
polymeric Immunoglobulin receptor 
flavin containing monooxygenase 2 
POD domain; class 6; transcription factor 1 
glycerol-3-phosphate dehydrogenase 1 (sol 
ESTs; Weakly similar to CALCIUM-BIND 
ESTs 
ESTs 

serum deprivation response (phosphatidyls 



ESTs 

Human DNA sequence from clone 141H5 o 
ESTs 

yz44h12.s1 Morton Fetal Cochlea Homo sa 

ESTs 

ESTs 

EST 

DKFZP434C171 protein 

ESTs 

ESTs 

ESTs 

ESTs 

SRY (sex-determining region Y)-box 10 
Friedreich ataxia region gene X123 
ESTs; Weakly similar to cell death activate 
ESTs; Highly similar to WS basic-helix-loo 
ESTs 
ESTs 

ESTs; Weakly similar to WASP-famlly pro 

ESTs 

ESTs 

ESTs 

phospholemman 

natriuretic peptide receptor A/guanylate cy 
ESTs; Moderately similar to 111! ALU SUB 
perilipin 

HUM427D08B Human fetal brain (TFuJiw 
phospholemman 

ESTs; WeaWy similar to WAA0795 protei 
zx39g11.r1 Soares_totaLfetus.Nb2HF8J 



R1 

1.7 
1.5 
2.3 
1.7 
1.5 
2.9 
1.6 
1.6 
1.6 
3 

1.8 
1.5 
1.8 
2.4 
1.7 
1.5 
1.5 
1.6 
2.7 
1.6 
1.7 
2.4 
1.7 
1.6 
1.9 
1.5 
1.9 
1.6 
2 

2.2 
1.7 
1.7 
1.7 
2.8 
1.9 
3 

1.6 
1.8 
1.5 
2.5 
2.1 
1.5 
1.8 
1.5 
1.7 
1J 
1.6 
1.5 
1.7 
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127638 AA634406 Hs.122608 ESTs 1.5 

128213 AA972780 Hs.129194 ESTs; Weakly similar to 11!! ALU SUBFA 1.5 

128351 AI092391 Hs.134886 ESTs 1.5 

128842 N44757 Hs.20340 ESTs 1.6 

128870 R71403 Hs75309 eukaryotic translation elonga fton factor 2 1.7 

129146 AA459944 Hs.108924 DKFZP586P1 422 protein 1.5 

129285 T62068 Hs.11006 ESTs 2.1 

129331 N93465 Hs.279772 ESTs; Highly similar to CGI-38 protein [H 1.5 

130085 M52402 Hs.274313 insulin-like growth factor binding protein 6 1.7 

130400 M25079 Hs.283108 hemoglobin; beta 1.7 

131267 AA211776 Hs.2504 myomesm 1 (skelemin) (185kD) 3.8 

131277 AA131466 Hs.23767 ESTs 1.9 

131282 M12272 Hs.4 alcohol dehydrogenase 3 (class I); gamma p 22 

131304 AA295848 Hs.25475 aquaporin7 1.7 

131810 D49487 ' Hs.1 94236 leptin (murine obesity homolog) 2.5 

132788 AA045503 Hs.56874. ESTs; Weakly similar to Homo sapiens p2 1.6 

132931 Z41452 Hs.6090 deleted in bladder cancer chromosome regi 1.5 

133120 X64559 Hs.65424 tetranectin (plasmihogen-blnding protein) 2 

133314 U95367 Hs.70725 garnma-aminobutyric acid (GABA) A recep 1.5 

133507 X74295 Hs.74369 integrin; alpha 7 1.7 

133601 S95936 Hs.284176 transferrin 2.3 

133702 N56898 Hs.75652 glutathione S-transferase M5 1.9 

134111 N79674 Hs.8022 TU3AprotBin 4.6 

134699 U56814 Hs.88646 deoxyribonuciease Hike 3 1.5 

134749 L10955 Hs.89485 carbonic a nhydrase IV 1.6 

135173 M72885 Hs.95910 Human G0S2 protein gene; complete cds 1.9 

300132 AW027556 Hs.156286 ESTs 1.7 

300732 AI369956 Hs.257891 ESTs 1.5 

300750 AA514805 Hs.293Q55 ESTs 1.8 

301140 AI807692 Hs.129129 ESTs 1.6 

301396 AA923549 Hs.224121 ESTs 2.1 

302910 N77976 Hs.251577 hemoglobin; alpha 1 . 1.8 

303798 V00505 Hs.36977 hemoglobin; delta 1.6 

303831 T04868 Hs.46780 EST duster (not In UnlGene) with exon hit 1.7 

303844 U94362 Hs.58589 glycogenin2 1.5 

304182 H91086 EST singleton (not in UniGene) with exon 1.5 

304622 AA516384 EST singleton (not in UniGene) with exon 1.5 

304662 AA550994 EST singleton (not in UniGene) with exon 1.7 

305612 M782347 Hs.272572 EST singleton (not In UniGene) with exon 1.5 

306193 AA923457 EST singleton (not in UniGene) with exon 1.5 

307206 AI192534 EST singleton (not in UniGene) with exon 1.6 

307377 AI222691 EST singleton (not in UniGene) with exon 1.5 

308023 AI452732 Hs.251577 EST singleton (not In UniGene) with exon 1.9 

308359 AI612774 Hs.79372 retinoid X receptor; beta 1J5 

309838 AW296073 Hs.255504 EST 1.5 

310403 AI720978 Hs. 148006 ESTs; Moderately similar to alternatively s 1 .8 

311671 AW241947 Hs.232478 ESTs 1.6 

311794 AW238092 Hs.254759 ESTs 2.1 

312082 T79860 Hs.118180 ESTs 1.9 

312575 H25237 Hs.306814 ESTs. 2.3 

313076 N49684 Hs.143040 ESTs 1.8 

313283 W32480 Hs.157099 ESTs 2.2 

313374 AW328672 Hs.132760 ESTs 1.9 

314701 AI754634 Hs.131987 ESTs. 1.7 

315391 AA759098 Hs.192007 ESTs 1.8 

315688 AA680055 Hs.264885 ESTs 1.5 

316249 AA948612 Hs.130414 ESTs 1.6 

316586 AI205077 Hs.294085 ESTs 1.7 

316890 AA837079 Hs.24647 ESTs ' 1.5 

316983 AI480204 Hs,177131 ESTs 1.5 

317604 AI650625 Hs.300756 ESTs 1.6 

317951 AW206520 Hs.129621 ESTs 1.5 

319400 W26902 Hs.154085 ESTs 1.7 

320757 H22654 Hs.6382 EST cluster (not In UniGene) 1.5 

321594 AA021402 Hs.11067 ESTs 1.7 

322102 H45589 EST cluster (not in UniGene) 1.5 

322814 AI824495 Hs^11038 ESTs 2.2 
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TABLE ISA 



Table ISA shows the accession numbers for those pkeys lacking unigenelD's for Table 15. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
andmRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene duster number 
Accession: Genbank accession' numbers ■ 

15 

Pkey CAT number Accession 

126300 250375_2 D81972 BE003132 
20 112538 504579 J AA908813 R70255 

123505 genbankJ\A600135 AA500135 * 

104672 6735 7 AA349096 AI368018 F21390 F17759 R48772 AI421485 AI300352 H43971, AI378525 F33652 R47898 AI264177 F22289 N28263 

. AI276281 R48205AI245302AI190036AI281050AW242903 H42892AA910870AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
25 322102 46708J H45589 H19807 AF075038 H19808 H42437 • 

336865 CH22J590FG_305J_ 
338192 CH22_6755FG_UNK_EMAC00 
329733 c14j>2 
326120 C17J1S 
30 326139 d7Jis 
326855 c20Jis 

335352 CH22_2699FG_539_5_L!NK_EM 

335639 CH22_^999FG_584J9JJNK_E 

307206 AI192534 
35 307377 A1222691 

337494 CH22»5727FG_799J2_ 

337764 CH22L_6115FG_LINI^EM:AC0O . 
. 337983 CH2^6438FG_UNK.EMAC00 

339366 CH22 8336FG_JJNK_BA354i1 
40 325272 clljis 

325558 c12Jw 

325656 c14_hs 

334175 CH2^1455FG.349_10_UNKJ 
304182 H91086 
45 334347 CH2^1640FG_375_31J-INK_E 
327438 cJLhs 
304622 AA516384 

334737 CH22J049FG_424J2_UNK_E 
304682 AA550994 
. 50 336244 CH2^3642FG_746JULINieDA 
306193 AA923457 

336336 CH22J746FGJ14JJJNK.BA 



260 



WO 02/059377 



PCT7US02/02242 



TABLE 15B 



10 



15 



20 



25 



30 



35 



Table 15B shows the genomic positioning for those pkeys lacking unigene ED's and 
accession numbers in Table 15. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 



Pkey: 
Ref: 

Strand: 



Pkey Ref 



Unique number corresponding to an Eos probeset 

Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA sequence of human chromosome 22/ Dunham I. et al., Nature (1999) 402:489495. 
Indicates DNA strand from which exons were predicted. 
Indicates nucleotide positions of predicted exons. 



Strand Nt_position 



334347 Dunham, L etal. Plus 13663814-13663926 

334737 Dunham, I. etaL Plus 15998517-15998685 

335639 Dunham, L etal. Plus 25173591-25173696 

337494 Dunham, I. etaL Plus 33339024-33339148 

334175 Dunham, I. etal. Minus 11666659-11668597 

335352 Dunham, I. etal. Minus 22681512-22681384 

336244 Dunham, I. etal Minus 31402729-31402583 

336336 Dunham. I. etal. Minus 33797209-33797076 

336865 Dunham, I. etal. Minus 8622406-8622289 

337764 Dunham, I. etal Minus 4035640-4035446 

337983 Dunham, I. etal. Minus 7275495-7275271 

338192 Dunham, I. etal. Minus 13248453-13248277 

339366 Dunham, I. etal. Minus 33647431-33647293 

325272 5866902 Minus 13247-13312 

325558 6056302 Plus 70930-71030 

325656 6056305 Minus 78190-78707 

329733 6065783 Plus 163237-163450 

326120 5867194 Plus 36116^36276 

326139 5867203 Minus 218901-218960 

326855 6552460 Minus 111390-111463 

327438 6004454 Minus 199569-199692 
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TABLE 16: Table 4 from BRCA 001-5 US 

I* 

Table 16, a subset of table 15, depicts a preferred group of genes highly downregulated in 
breast cancer cells. 



Pkey: 
ExAccrt 
10 UmgenelCh 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of normal breast tissue to tumor 



15 



20 



25 



30 



35 



40 



45 



Pkey ExAccn 


UniGene (D 


Unigene Title 


100502 TIGR:HT1498 


Hs.169228 


Adrenal-Specific Protein Pg2 
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Hs.4 


alcohol dehydrogenase 1 (class I); alpha 


102857 X00129 


Hs.76461 

* 


retlnol-binding protein 4; interstitial 


104672 AA007629 


glycerol-3-phosphate dehydrogenase 1 
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Ks.211568 


eukaryotic translation initiation fector 4 gam 


108604 AA099820 


Hs.49696 


ESTs 


115949 AA443800 


Hs.43125 


ESTs 


115965 AA446661 


Hs.173233 


ESTs 


119175 R71792 


Hs.301002 


ESTs; Weakly similar to cell death activator 


119798 W73386 


Hs.249129 


ESTs 


122127 AA434447 


Hs.1067?* 


ESTs 


122348 AA443695 


Hs.2J?.V,lu 


ESTs 


129285 T62068 




ESTs 


131267 AA211776 




Ttfjsin 1 (skelemin) (185kD) 
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131810 D49487 




■'. (murine obesity homolog) 
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3 

2.5 
2.1 
2.1 
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23 
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4.6 
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TABLE 16A 



10 



15 



20 



Table 16A shows the accession numbers for those pkeys lacking unigenelD's for Table 16. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accession 



104672 6735_7 



AA349096 AI368018 F21390 F17759 R48772 AI4214B5 A1300352 H43971 AI378525 F33B52 R47898 AI264177 F22289 N28263 
AI276281 R48205 AI245302 AI190036 AI281 050 AW242903 H42892 AA910870 AW473816 H25721 AW451438 F19647 F22375 
H45809 F33447 AA774528 AA007629 H42537 C01077 F32386 
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TABLE 17: Table 1 from BRCA 014 P 

Table 17 shows accession numbers representing 759 sequences of breast cancer genes or 
fragments thereof encoding breast cancer modulating proteins. For each overexpressed gene 
identified, a ratio of the relative amount of expression in breast tumors versus normal breast 
tissue. 



10 



15 R1: 



Pkey: 
ExAccn: 
UnigenelD: 
Unigene Title: 



Unique Eos probeset identifier number 
Exemplar Accession number, Genbank at 
Unigene number 
Unigene gene title 

Ratio of tumor to normal breast tissue 



;ion number 



Pkey ExAccn UnfgenelD Unigene Title 



20 100227 
100405 
100406 
100420 
100911 

25 100960 
101011 
101163 
101194 
101329 

30 101378 
101474 
101491 
101530 
101602 

35 101663 
101758 
101767 
101817 
101851 

40 101878 
102209 
102214 
102297 
102299 

45 102301 
102305 
102369 
102591 
102721 

50 102739 
102791 
102804 
102903 
103010 

55 103042 
103117 
103207 
103282 
103284 

60 103329 
103364 
103385 
103456 



AV654694 

AW291587 

AI962060 

D86983 

X83300 

J00124 

BE387036 

AA442324 

120971 

U66042 

BE563085 

R07566 

M25809 * 

M29874 

AA353776 

NM.Q03528 

BE019494 

M81057 

M89907 

BE260964 

M97815 

NWL002038 

U23752 

MM-001504 

NMJJ05824 
NM-005651 
AL043202 
U39840 
U62325 
M16646 
AA363025 
AF 
NM_I 
M73779 
X52509 
T81656 
X63578 
X72790 
BE390551 
AJ751601 
X85134 
X90872 
NMJW7069 
AA496425 
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18 



Hs.82316 

Hs.82733 

Hs.118397 

Hs.118893 

H3.289103 

Hs.1 17729 

Hs.1211 

Hs.795 

Hs.188 

Hs.82171 

Hs.833 

Hs.73817 

Hs.64173 

Hs.1360 

Hs.901 

Hs.2178 

Hs.79217 

Hs.180884 

Hs.152292 

Hs.82045 

Hs.183650 

Hs.265827 

Hs.32964 

Hs.198252 

Hs.155545 

Hs.183671 

Hs.90073 

Ks.299867 

Hs.324125 

Hs.1 18666 

Hs.155572 

Hs.83354 

Hs.250505 

Hs.161640 

Hs.252259 

Hs.295449 

Hs.77628 

Hs.8375 

Hs.72984 

Hs.279929 

Hs.37189 

Hs.9629 



interferon-Induced, hepatitis C-assodat 
nidogen2 

AE-blnding protein 1 



SMA4 

keratin 14 (epidermolysis bullosa simple 
acid phosphatase 5, tartrate resistant 
H2A histone family, member 0 
phosphodiesterase 4B, cAMP-specific (dun 
Homo sapiens done 191B7 placenta expres 
interferon-stimulated protein, 15 kDa 
smalt inducible cytokine A3 (homologous 
ATPase, H+ transporting, lysosomal (vacu 
cytochrome P450, subfamily IIB (phenobar 
CD48 antigen (B-cell membrane protein) 
H2B histone family, member Q 
pyrroline-5-carboxyiate reductase 1 . 
carboxypeptidase B1 (tissue) 
SWI/SNF related, matrix associated, act! 
midkine (neurite growth-promoting factor 
cellular retinoic acid-binding protein 2 
Interferon, alpha-inducible protein (do 
SRY (sex determining region Y>-box 11 
G protein-coupled receptor 9 
37 kDa leucine-rich repeat (LRR) protein 
tryptophan 2,3-dioxygenase 
chromosome segregation 1 (yeast homotog) 
hepatocyte nuclear factor 3, alpha 
amyloid beta (A4) precursor protein-bind 
hypothetical protein PP591 
Human done 23801 mRNA sequence 
gb:Human endogenous retrovirus K clone 1 
fysyl oxidase-flke 2 
retinoic add receptor, alpha 
tyrosine aminotransferase 
ribosomal protein S3 
pan/albumin 

gbrHuman endogenous retrovirus mRNA for 

steroidogenic acute regulatory protein r 

TNF receptor-associated factor 4 

retinoblastoma-binding protein 5 

gp25L2 protein 

similar to rat HREV107 

papillary renal cell carcinoma (transtoc 

264 



R1 

3 

3^ 
3.6 
3.2 
52 
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103438 Y09306 Hs.30148 homeodomaWntenacting protein kinase 3 3.4 

103558 BE616547 Hs.2785 keratin 17 3.7 

103563 L02911 Hs.150402 Activin A receptor, type I (ACVR1) (ALK 3.2 

103612 BE336654 Hs.70937 H3 hfetone family, member A 4.5 

5 103825 AI571835 Hs.55468 ESTs 4 

104073 AW779318 Hs.B8417 ESTs 3.8 

104103 AW021102 Hs.21509 ESTs 4.3 

104115 AF1 83810 Hs.26102 opposite strand to trichorhlnophalangeal 7.6 

.104168 AA461618 Hs.31704 ESTs, Weakly similar to K1AA0227 [H.sapl 3.6 

10 104173 AA084273 Hs.76561 ESTs, Weakly similar to S47072 finger pr 4 

104181 AF173296 Hs.283740 DC6 protein 3. 

104189 AB040927 Hs.301804 KIAA1494 protein 3.2 

104269 AI559444 . Hs.293960 ESTs 4.3 

104307 AI929700 . Hs.1 11680 endosulfine alpha 3.1 

15 104518 H20B16 Hs.112423 Homo sapiens mRNA; cDNA DKFZp586l1 420 {f 3.2 

■ 104556 AV650851 Hs.96900 hypothetical protefri; KIAA1830 protein 4.4 

104658 AA360954 Hs.27268 Homo sapiens cDNA: FU21933 fis, clone H 3.2 

104748 AA015879 Hs.33536 ESTs 3.2 

104755 T49951 Hs.9029 DKFZP434G032 protein ' 4.5 

20 104825 AA035613 Hs.141883 ESTs 6.9 

104830 AW294092 Hs.21594 hypothetical protein MGC1 5754 11.1 

104865 T79340 Hs.22575 B-cell CLL/Iymphoma 6, member B (zinc fi 3.5 

104906 BE298684 Hs.26802 protein kinase domains containing protei 6.5 

104961 H78517 Hs.33905 ESTs 3.6 

25 105038 AW503733 Hs.9414 KIAA1488 protein 4.5 

105088 H58589 Hs.35156 Homo sapiens cONA FU1 1027 fis, done PL 3.8 

105092 AA148982 Hs.29068 ESTs 3 

105093 AL1375G6 Hs.32405 HomosapierismRNA;cDNADKFZp586G0321 (f 4.8 
105304 AW134924 Hs.190325 ESTs 8.2 

30 105397 AA814807 Hs.7395 hypothetical protein FU23182 3.1 

105409 AW505076 Hs.301655 DiGeorge syndrome critical region gene 8 . 4.2 

105431 AA252033 Hs.242413 hypothetical protein DKFZp434K1421 4.4 

105552 AA256750 Hs.28802 centaurln-alpha 2 protein 3.2 

105598 AA279439 Hs.279763 hypothetical protein FU 10504 3.5 

35 105650 W16741 Hs.25635 HSPC003 protein 3.7 

105688 AI299139 Hs.17517 ESTs 5.5 

105808 AI133161 Hs.286131 CG 1-101 protein 3.5 

105809 AW973653 Hs.20104 hypothetical protein FU00052 3.3 
105909 AA195191 Hs.5111 hypothetical protein FU20729 3.2 

40 105965 AA131657 Hs.23830 ESTs . 3.3 

106135 AL117474 Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3.2 

106184 W28948 Hs.10762 ESTs 3.3 

106293 N39842 Hs.301444 KIAA1673 4.1 

106400 BE397649 Hs.94109 Homo sapiens cDNA FU13634 fis, clone PL 3.1 

45 106474 BE383668 Hs.42484 hypothetical protein FU1061 8 3.2 

106484 AA351978 Hs.4943 hepatocellular carcinoma associated prot 7.8 

106533 AL134708 Hs.145998 ESTs 3 

106614 AA648459 Hs.335951 hypothetical protein AF301222 3.8 

106636 AW958037 Hs.286 rlbosomal protein L4 3.3 

50 106661 AW499914 Hs.7579 hypothetical protein FU10402 . 3 

106743 BE613328 Hs.21938 hypothetical protein FU12492 4.2 

106844 AA485055 Hs.158213 sperm associated antigen 6 3.4 

106864 AI31 1928 gb:qo89h04j(1 NCI_CGAPJ<id5 Homo sapiens 4.4 

106865 AW192535 Hs.19479 ESTs 3.6 
55 106871 AW472981 Hs.321 130 hypothetical protein MGC2771 4.1 

106942 M995351 Hs.31314 reUnoblastoma-btnding protein 7 3.6 

106988 AF216751 Hs.26813 CDA14 5.3 

107105 AW963419 Hs.155223 stanniocatein 2 3.4 

107158 N32849 Hs.31844 hypothetical protein FU12586 3.1 

60 107248 AW263124 Hs.315111 nuclear receptor co-repressor/HDAC3comp 5.9 

107265 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 3.9 

107630 AW961576 Hs.60178 ESTs 4.6 

107710 A1955040 Hs.265398 ESTs, Weakly similar to transformaUorw 3 

107890 AA025386 Hs.61311 ESTs, Weakly similar to S 10590 cysteine 3.1 

65 107985 T40064 Hs.71968 Homo sapiens mRNA; cDNA DKFZp564F053 (fr 4.8 

108000 AI263307 Hs.239884 H2B histone family, member L 3.3 

106217 AA058686 Hs.62588 ESTs '3.8. 
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108435 T82427 Hs.194101 Homo sapiens cDNA: FU20869 fis, clone A 3 

108591 AB033073 Hs.43857 similar to giucosamine-6-sulfatase3 3.3 

108733 AA121022 gb2n84f10.fi Stratagene lung carcinoma 3.9 

108771 AF068290 Hs.79741 hypothetical protein FU 101 16 6.1 

108819 AA011449 ' Hs.271627 ESTs 3.6 

108912 AA136674 Hs.118681 EST 3.9 

109086 AF186114 Hs.270737 tumor necrosis factor (Hgand) superfaml 3.7 

109124 AK000684 Hs.183887 hypothetical protein FU22104 3.1 

109132 AI970536 Hs.16603 hypothetical protein FU131 63 3.7 

109163 N23235 Hs.30567 ESTs, Weakly similar to B34087 hypotheti 4.5 

109277 AA196443 Hs.86043 Homo sapiens cDNA FU13558 fis, clone PL 3.7 

109410 AW504732 Hs.21275 hypothetical protein FU 11 011 4.6 

109454 AA232255 Hs.295232 ESTs, Moderately similar to A46010 X-Iln 6.4 

109514 AA234087 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 4.8 

109581 R45584 Hs.23025 ESTs, Weakly similar to ALU5_HUMAN ALU S 3.3 

109632 AA325138 Hs,235873 hypothetical protein FU22672 3 

109644 AW973964 Hs.291531 ESTs, Highly similar to 1203217A dehydro 3 

109700 F09609 • gb:HSC33H092 normalized Infant brain cDN 3.2 

109768 F06838 Hs.14763 ESTs 3.2 

109807 R43646 Hs.12422 ESTs 3.8 

109842 AW818436 Hs.23590 solute carrier family 16 (rrwnocarboxylic 3.3 

109895 AK001680 Hs.30488 DKFZP434F091 protein 3.6 

110024 AW973152 Hs.31050 ESTs 4.2 

110561 AA379597 Hs.5199 HSPC150 protein simiiarto ubfquitjn-con 5.1 

110675 H89355 Hs.249159 adrenergic alpha-2A-, receptor 5.3 

110707 AI239832 Hs.15617 ESTs, Weakly similar to ALU4 JHUMAN ALU S 3.7 

110915 BE092285 Hs.29724 hypothetical protein FU13187 3.7 

111139 N64683 Hs.290943 ESTs 4 

111155 N66563 Hs.191358 ESTs 3.1 

111199 AI767435 Hs.29822 ESTs 4.5 

111336 AI457338 Hs.29894 ESTs 5.4 

111510 R07856 Ha.16355 ESTs 3.2 

111532 R08440 gb^f19f09^1 Soares fetal Bver spleen 3.1 

111689 AA6Q2004 Hs.23260 ESTs 3.2 

111823 R35252 Hs.24944 ESTs, Weakly similar to 2109260A B cell 3.3 

111876 R38239 • Hs.293246 ESTs, Weakly similar to putative p1 50 (H 3.1 

111892 AA421081 Hs.12388 ESTs 3.4 

111893 AF070526 Hs.13429 Homo sapiens clone 24787 mRNA sequence 3.3 
112125 AW379029 Hs.1 18338 ESTs, WeaWy similar to unnamed protein 4.4 
112170 BE246743 Hs.288529 hypothetical protein FU22635 7.3 
112287 AB033064 Hs.334806 KIAA1238 protein 3.2 
112300 H24334 Hs.26125 ESTs 4.4 
112303 R54797 gb:yg87b07.s1 Soares infant brain 1 NIB H 3.4 
112478 R66067 Hs.28664 ESTs 8.2 
112561 AI791493 Hs.1 29873 ESTs, Weakly similar to A36036 cytochrom 5.5 
112631 R82040 gb:yj06b06.s1 Soares placenta Nb2HP Homo 3.9 
112637 R82331 Hs.164599 ESTs 5.4 
112657 AW844878 Hs.19769 hypothetical protein MGC4174 3.2 
112678 AI418466 Hs.33665 ESTs 4.7 
112917 AA082465 Hs.125031 choiine/ethanolaminephosphotransferase 3.7 
113070 AB032977 Hs.6298 K1AA1 151 protein 3.1 
113095 AA828380 Hs.126733 ESTs 3.4 
113117 AW813731 Hs.159153 ESTs, Moderately similar to S65657 alpha 3.4 
113187 BE613410 Hs.31575 SEC63, endoplasmic reticulum translocon 3.2 
113200 T57773 Hs.10263 ESTs 3.5 
113206 BE262470 Hs.241471 RNB6 6.2 
113374 T79925 Hs.269165 ESTs, Weakly similar to ALU1_HUMAN ALU S 3.7 
113440 U54727 Hs.191445 ESTs . 3 
113494 T91451 Hs.86538 ESTs 3.4 
113518 AW367788 Hs.323954 postmeiotic segregation increased 2-iike 3.1 
113571 AI702609 Hs.15713 hypothetical protein MGC2776 3.1 
113822 NM_004585 Hs. 17466 retinoic acid receptor responder (tazaro 3.9 
113835 A1912410 Hs.27475 Homo sapiens CDNAFU12749 fis, done NT 3 
113938 W81598 gb:zd8Bg02.s1 Soares JetaLheartNbHH19W 4.6 
113947 W84768 gb:zh53d03.s1 Soares JetaUiver.spleeru 3.1 
113970 W27249 Hs.8109 hypothetical protein FU21080 6.9 
114086 AA378776 Hs.288649 hypothetical protein MGC3077 4.3 
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-114148 

114424 

114518 

114563 

114965 

114995 

115121 

115134 

115167 

115253 

115277 

115327 

115354 

115657 

115676 

115709 

115729 

115787 

115830 

115835 

115850 

115900 

115935 

115948 

116092 

116115 

116184 

116192 

116208 

116246 

116443 

116470 

116726 

116845 

117026 

117216 

117296 

117403 

117691 

118229 

118363 

118416 

118470 

118502 

118695 

118925 

119025 

119036 

119063 

119075 

119620 

119741 

119747 

119754 

119905 

120084 

120241 

120326 

120742 

120870 

120885 

120970 

121054 

121095 

121103 

121121 

121337 



AW470411 

AW780192 

AW163267 

AI979168 

A1733861 

AA769266 

AI634549 

AW968073 

AA749209 

BE149845 

AA814100 

N46436 

AA281636 

AA405620 

AA953006 

AW293849 

AA417812 

A1126772' 

AW970529 

AA521410 

NMJM937 

AK001500 

AA354549 

AL042465 

AB041035 

AL042355 

AW450737 

AA464976 

AI219083 

AF265555 

AW962196 

AI272141 

AK001114 

AA649530 

H88256 

AI569804 

AL133427 

H84455 

AB040959 

AW968941 

AI183838 

N66028 

AW970584 

AL157488 

AK000465 

N92293 

BE003760 

R95872 

R16833 

M10905 

W47620 

AF041853 

AI970797 

AL037824 

AW449064 

W94472 

AA825686 

AA196300 

AA225084 

AA357172 

AA365515 

AA398118 

AW976570 

AA320134 

AA398936 

AA399371 

AW885727 



Hs.288433 neurotrimin 4.1 

Hs.267596 ESTs 3.4 

Hs.1 06469 suppressor of varl (S.cerevisiae) 3-flke 3.1 

Hs.82226 glycoprotein (transmembrane) nmb 4.8 

Hs72472 BMP-R1B 10.1 

Hs.193657 ESTs 3.6 

Hs.88155 ESTs 3.2 

Hs.194331 ESTs, Highly similar to A55713 Inositol 42 

Hs.43728 hypothetical protein 3 

Hs.289038 hypothetical protein MGC4126 3.6 

Hs.86693 ESTs 35 

Hs.109221 ESTs 3.4 

Hs.334827 ESTs 4.8 

Hs.55158 ESTs, Weakly similar to T29520 hypotheti 3.5 

Hs.88143 ESTs 9.3 

Hs.58279 ESTs, Weakly similar to ALU7_HUMAN ALU S 3.4 

Hs.38775 ESTs 4 

Hs.40479 ESTs 3.1 

Hs.86434 hypothetical protein FU21816 3.6 

Hs.41371 ESTS 3.1 

Hs.52463 KIAA0966 protein 3 

Hs.165186 hypothetical protein FU13852 3.2 

Hs.41181 Homo sapiens mRNA; cDNA DKFZp727C191 (fr 3 

Hs.43445 poiy(A)-spedficribonuclease(deadenyia 3.1 

Hs.93847 NM_016931:Homo sapiens NADPH oxidase 4 ( 67 

Hs.70202 WD repeat domain 10 3.6 

Ks.128791 CGI-09 protein 3.1 

Hs.62528 ESTs, Moderately similar to A46010 X-On 3.3 

Hs.42532 ESTs, Moderately similar to ALU8_HUMAN A 3.2 

Hs.250646 baculoviral IAP repeat-containing 6 3.6 

Hs.321264 LBP protein 32 4.1 

Hs.83484 SRY (sex determining region Y>box 4 4.1 

Hs.53913 hypothetical protein FU10252 8.6 

gb:ns44f05.s1 NCLCGAP_Alv1 Homo sapiens 3.2 

Hs.50456 ESTs, Moderately similar to ZN75.HUMANZ 3.5 

Hs.42792 ESTs, Weakly simBar to I78885 serine/th 3.1 

Hs.42506 Homo sapiens mRNA full length insert cDN 3.2 

Hs.40639 ESTs 4.7 

Hs.93836 DKF2P434N014 protein 3 

Hs.166254 hypothetical protein DKFZp566l133 , 3.3 

Hs.48938 hypothetical protein FU21802 4.3 

Hs.49105 FKBP-assodated protein 3.1 

Hs.291033 ESTs 3.4 

Hs.50150 Homo sapiens mRNA; cDNADKFZp564B1B2(fr 5.2 

Hs.50081 K1AA1 199 protein 3.4 

Hs.206832 ESTs, Moderately similar to ALU8_HUMAN A 3.3 
Hs.55209 Homo sapiens mRNA; cDNA DKFZp434K0514 (f 19.7 

Hs. 1 17572 chemokine binding protein 2 3.7 

Hs.53106 ESTs, Moderately similar to ALU1JWMAN A 4.1 

Hs.287820 fibronectin 1 3.2 

Hs.56009 2 , -5 , -oligoadenyiate synthetase 3 (100 k 3.3 
Hs.43670 kinesin family member 3A • 3.1 

Hs.64859 ESTs 5 

Hs.194695 rashomolog gene family, member l 3.8 

Hs.1 19571 collagen, type 111, alpha 1 (Ehlers-Danl 3.1 

Hs.59529 ESTs, Moderately similar to ALU1_HUMAN A 8.4 

Hs.321 176 ESTs, Weakly similar to S65824 reverse t 3.6 

Hs.21145 hypothetical protein RG083M05.2 3.2 

gb:nc21d06.r1 NCLCGAP_Pr1 Homo sapiens 3.6 

Hs.292581 ESTs. Moderately similar to ALULHUMAN A 5.8 

Hs.301872 hypothetical protein MGC4840 3 

Hs.97579 ESTs, Weakly similar to A46010 X-iinked 3.7 

Hs.97387 ESTs 5.3 

Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 4 

Hs.97697 EST 3.5 

Hs.189095 similar to SALL1 (sal (DrosophiiaHike 6.3 

Hs.301570 ESTs 4.7 
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121351 AW206227 Hs.287727 hypothetical protein FU23132 5 

121611 M31669 Hs.1735 Inhlbln, beta B (acttvin A6 beta potypep 3.6 

121643 AA640987 Hs.193767 ESTs 5.6 

121770 NMJJ159Q2 Hs.278428 progestin Induced protein 3.4 

122125 AK000492 Hs.98806 hypothetical protein 4.1 

122338 AA443311 Hs.98998 ESTs 3 

122417 AA446965 Hs.112092 ESTs 4.7 

122513 AI767879 Hs.99214 ESTs 3.8 

122544 AW973253 Hs.292689 ESTs 3 

122655 AA323296 Hs.97837 Homo sapiens mRNA; cDNA DKFZp547J047 (fr 5.6 

122805 AA526911 Hs.82772 collagen, type XI, alpha 1 3.2 

122851 AW205931 Hs.99598 hypothetical protein MGC5338 8.6 

123105 AA487809 Hs.166011 catentn (cadherhvassoclated protein), d 3 

123111 AA228776 . Hs.191721 ESTs 6.9 

123249 AA371307 Hs.125056 ESTs » 3.6 

123273 AA491253 Hs.173611 Empirically selected from AFFX single pr 7 

123385 BE149685 Hs.17767 KIAA1554 protein 3.1 

123419 T66087 Hs. 112482 Homo sapiens unknown mRNA sequence 3.4 

123485 A1308876 Hs.103849 hypothetical protein DKFZp761D1 12 3.1 

123645 AI675944 Hs.188691 Homo sapiens cDNA FU12033 fis, done HE 3.8 

123819 AA580082 Hs.112264 ESTs 4.7 

124012 AA352723 Hs.241471 RNB6 3.8 

124243 H69125 Hs.133525 ESTs 4.1 

124357 N22401 ' gb:yw37g07.s1 Morton Fetal Cochlea Homo 4.1 

124359 N22508 Hs.139315 Homo sapiens cDNA: RJ21479 fis, done C 3.6 

124567 AW451645 Hs.151504 Homo sapiens cDNA FU11973 fis, done HE 3.1 

124911 N34151 Hs.174195 interferon induced transmembrane protein 3,5 

124972 R41396 Hs.101774 hypothetical protein FU23Q45 4.3 

125006 BE065136 Hs.145696 splidng factor (CC1. 3) 6 

125042 T78906 Hs.269432 ESTs, Moderately similar to ALU1_HUMAN A 8.1 

125184 W60326 Hs.288684 Homo sapiens cDNA FU 11750 fB, done HE 4.7 

125243 AW970536 Hs.105413 ESTs 3.1 

125286 AF086534 Hs.187561 ESTs, Moderately similar to ALU1_HUMAN A 3.3 

125304 AL359573 Hs.124940 GTP-binding protein 3 

125330 AW880562 Hs.114574 ESTs 3 

125331 AI422996 Hs.161378 ESTs 3.2 
125685 AI924630 Hs.4943 hepatocellular cardnoma associated prot 3.2 
126257 N99638 gb:za39g11.ri Soares fetal liver spleen 4 
126474 AW975814 Hs.326714 Homo sapiens done IMAGE:713177, mRNA se 4 
126666 AA648886 Hs.151999 ESTs . 3.8 
126872 AW450979 gb:UI-H-BI3-a1*a-12-0-UI.S1 NCtCGAP.Su 3 
127431 AW771958 Hs.175437 ESTs, Moderately similar to PC4259 ferri 3.6 
127980 AA961459 Hs.125644 ESTs 4.1 
127997 AW068311 Hs.31 1 054 Homo sapiens mRNA full length insert cDN 3.3 
128420 AA650274 Hs.41296 fibronectin leucine rich transmembrane p 4.6 
128609 NML003616 Hs.1 02456 survival of motor neuron protein interac 3.9 
128946 Y13153 Hs.107316 kynurenine 3-monooxygenase (kynurenine 3 3.1 
128955 AA775076 Hs.185807 Homo sapiens, Similar to PRO0478 protein 3.9 
129092 D56365 Hs.63525 pory(rC)-binding protein 2 3.3 
129270 AA357185 Hs.109918 ras homolog gene family, member H 3.1 
129301 AF182277 Hs.330780 cytochrome P450. subfamily IIB (phenobar 3.9 
129385 AA172106 Hs.110950 Rag C protein 6.2 
129619 AA209534 Hs.284243 tetraspan NET-S protein 3.4 
129629 AKD00398 Hs.11747 hypothetical protein FLJ20391 3 
129725 X56411 Hs.1219 alcohol dehydrogenase 4 (dass II), pi p 3.2 
130069 AI754813 Hs.146428 collagen, type V, alpha 1 5.4 
130092 X03363 Hs.323910 v-erb-b2 avian erythroblastic leukemia v 4.4 
130298 AI347487 Hs.132781 dass I cytokine receptor 4.6 
130382 NM.003450 Hs.1 55204 zinc finger protein 174 ■ 5.6 
130622 AI582291 Hs.16846 ESTs, Weakly similar to 04HUD1 debrisoqu 3 
130703 R77776 Hs.18103 ESTs . 3.8 
130881 AA809875 Hs.25933 ESTs 4.2 
130954 AB014544 Hs.21572 KIAA0644 gene produd 4.7 
131095 AI399653 Hs.22917 ESTs 4,3 
131153 H09048 Hs.23606 ESTs . 3.8 
131253 R71802 Hs.24853 ESTs 3.5 
131372 AW293399 Hs.1 44904 nudear receptor co-repressor 1 3.6 
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133142 
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133976 
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134710 
134731 
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302384 
302385 
302680 
302830 
302857 



AI826268 Hs.27769 ESTs, Weakly similar to MCAT_HUMAN MFTOC 3.2 ■ 

AI695549 Hs.183868 glucuronidase, beta 3.1 

AF017986 Hs.31386 secreted frizzled-related protein 2 3.2 

BE501849 Hs.32317 high-mobility group 20B 3.2 

D86960 ' Hs.3610 KIAA0205 gene product 3.6 

NNI002314 Hs.36566 LIM domain kinase 1 3.2 

AA400091 Hs.39421 ESTs 3.2 

AA426202 Hs.40403 Cbp/p300-interact]ng transactor, wit 3 

D76435 Hs.41154 Zic family member 1 (od(H»ired Drosophi 3.2 

AA192669 Hs.45032 ESTs 3.5 

AL133731 Hs.4774 Homo sapiens mRNA;cDNADKFZp761C1712(f 9.2 

AV660345 Hs.238126 CGI-49 protein 8.2 

BE568452 Hs.5101 protein regulator of cytokinesis 1 3.4 

AA326108 Hs.33829 bHLH protein DEC2 3.2 

AA319233 . Hs.5521 ESTs 4.8 

NMJ)06276 Hs.184167 splicing factor, arginine/serine-rich 7 3.6 

W73311 Hs.169407 SAC2 (suppressor of actln mutations 2, y 3.2 

T48195 Hs.58189 eukaryotic translation initiation factor 3.5 

Y00272 Hs.184572 ceP division cycle 2, G1 to S and G2 to 4.4 

AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 4.8 

AI128606 Hs.6557 zinc finger protein 161 3.3 

AW952412 Hs.65874 ESTs, Weakly similar to A40348 Ebv/Sex- 3.5 

AW162840 Hs.6641 kinesin family member 5C 4.5 

AW600291 Hs.6823 hypothetical protein FU1 0430 3.3 

AA085191 Hs.6949 hypothetical protein MGC 11 275 3 

293241 Hs.239934 CGI-96 protein 4.5 

AW797437 Hs.69771 B-factor, properdin 4.1 

BE618768 Hs.7232 acetyl-Coenzyme A carboxylase alpha 5.1 

AW675064 Hs.73875 fumarylacetoacetate hydrolase (fumarylac 3 

AW162919 Hs.170160 RAB2, member RAS oncogene family-like 3.4 

BE274552 Hs.76578 protein Inhibitor of activated STAT3 3.9 

AI908165 Hs.169946 GATA-binding protein 3 (T-cell receptor 6.2 

BE391929 Hs.8752 transmembrane protein 4 3.1 

AI433797 Hs.8889 serine hydroxymethyltransferase 1 (solub 3 

D89377 Hs.89404 msh (DrosophUa) homeo box homobg 2 5.8 

J05582 Hs.89603 mucin 1, transmembrane 4 

AF064804 Hs.96757 suppressor of Ty(S.cerevisiae) 3 homoto 3.2 

R61253 Hs.98265 KIAA1877 protein 3.3 

X78592 Hs.99915 androgen receptor (dihydrotestosterone r 4.8 

L10333 Hs.99947 reticulonl 3.8 

AI199738 Hs.208275 ESTs, Weakly similar to ALUAJiUMAN UK 3.8 

AW614220 Hs.189402 ESTs 4.2 

AW183618 Hs.55610 solute carrier family 30 (ztoc transport 9.9 

AW591433 Hs.298241 Transmembrane protease, serine 3 4.9 

Z45270 Hs.235873 hypothetical protein FU22672 3.4 

AA572949 Hs.207566 ESTs 3.5 

R10799 Hs.191990 ESTs 3.8 

AA887801 Hs.208229 G protein-coupled receptor 13.9 

AI091631 Hs.203845 two pore potassium channel KT3.3 4.4 

AA312082 Hs.105445 GDNF family receptor alpha 1 5.7 

079745 Hs.114924 solute carrier family 16 (monocarboxylic 8.6 

T97905 gb:ye54d 0.rl Soares fetal liver spleen 3.9 

AB020711 Hs.278346 KIAA0904 protein 7.7 

BE542706 Hs.222399 CEGP1 protein 7.3 

AW749321 Hs.6786 ESTs 3.3 

AL049670 Hs.137576 ribosomal protefo L34 pseudogene 1 4.2 

NMJ03613 Hs.151407 cartilage Intermediate layer protein, nu 7.9 

AL049987 Hs.166361 Homo sapiens mRNA;cDNADKFZp564F1 12 (fr 5.6 

AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 5.4 

AA179949 Hs.175563 Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34,1 

AL1 17406 Hs.200102 ATP-binding cassette transporter MRP8 6.7 

AL109712 Hs.296506 Homo sapiens mRNA fuQ length Insert cON 4 

AI678059 Hs.202676 synaptonemal complex protein 2 4.3 

AJ224172 Hs.204096 lipophilin B (uteroglobin family member) 13.8 

AW192334 Hs.38218. ESTs 9.6 

AI038997 Hs.132921 ESTs 5 

AF282265 Hs.44836 Inner centromere protein antigens (135kD 3.4 
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302892 AW176909 Hs.42346 raldneurin-binding protein caJsarcin-1 3.4 

302970 W05608 Hs.312679 ESTs. Weakly similar to A49019 dyneln he 5.1 

303271 AA652687 Hs.96151 Human DNA sequence from clone RP5-1 103G7 3.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 4.1 

303357 AW006352 Hs.159643 ESTs, Weakly similar to T32554 hypotheti 4.2 

303540 AA355607 Hs.309490 ESTs, Weakly similar to putative WHSC1 p 4.3 

303563 AA367699 Hs.10082 potassium intermediate/small conductance 3.3 

303642 AW299459 gb:xs50d08.x1 NCLCGAPJ<id11 Homosapien 4.2 

303780 AI424014 Hs.18995 KIAA1304 protein 3.6 

303797 AW629759 gb:hh70e05.y1 NCLCGAP J3U1 Homo saptens 4.9 

303852 R53434 Hs.90207 hypothetical protein MGC1 1138 3.7 

304328 AA149951 Hs.62112 zinc finger protein 207 3 

304782 AA582081 gbmn32hOB.s1 NClCGAPJ3as1 Homo sapiens 4,1 

305913 AA876109 . gb:nx24h01.s1 NCLCGAPJ3C3 Homo sapiens 3 

305917 AA876469 gb:oe48b04.s1 NCLCGAP_Pr25 Homo sapiens 3.1 

307010 AI140014 gb:qa68f09jc1 Soares fetaLheart_NbHH19W . 3.5 

307041 AI144243 gbxjb85b12j(1 SoaresJetaUiearLNbHH19W 3.9 

308106 AI476803 gbij77e1Zx1 SoaresJYSF_F8_9W_OTJ>AJl.S 4.3 

308307 AI581398 Hs.172928 collagen, type I, alpha 1 4.6 

308615 AK000142 Hs.101774 hypothetical protein FU23045 4.4 

309177 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 173 

309328 AW024348 Hs.233191 EST. Weakly similar to A27217 glucose tr 3.2 

309574 AW168083 gbacg59g04jt1 NCL.CGAPJJt4 Homo sapiens 3.1 

309583 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-8R 57.6 

310064 A1199712 Hs.148486 ESTs, Weakly similar to 1917210A Pro/Arg 4.6 

310098 AI685841 Hs.161354 ESTs 3.6 

310438 AW022192 Hs.200197 ESTs 4.6 

310683 AI939456 Hs.160870 ESTs 3.2 

310727 AK000703 Hs.323822 Homo sapiens mRNA for KIAA1551 protein, 3.6 

310781 AJ380797 Hs.158992 ESTs 10.2 

310895 AI955121 Hs.165724 N^cetylgalactosamine-4-O-sulfotrarisfera 3.4 

310955 AI476732 Hs.263912 ESTs 105 

311117 AI671439 Hs.196029 Homo sapiens mRNA for KIAA1657 protein, 3.1 

311166 AJ821005 Hs.118599 ESTs 10.8 

311237 AA641098 Hs.208809 ESTs. Moderately similar to ALU 1JHUMAN A 43 

311465 AJ758660 Hs.206132 ESTs 4.4 

311587 A1828254 Hs.271019 ESTs, Weakly similar to A47582 B-ceD gr 5.1 

311598 AW023595 Hs.232048 ESTs 5.8 

311774 AA700870 Hs.14304 ESTs 33 

311785 AI056769 Hs.133512 ESTs . 3.9 

311872 R12375 Hs.194600 ESTs 33 

311889 AA767342 Hs.122483 ESTs, Weakly similar to PSFJ4UMAN PTB-AS 3 

311913 AI358522. Hs.270188 ESTs 3 

311923 T60843 ' Hs.189679 ESTs 5.6 

311935 AA216387 gb:nc16b0Zs1 NCLCGAP_Pr1 Homo sapiens 5.2 

312019 AA373630 Hs.188750 ESTs 3 

312021 AA759263 Hs.14041 ESTs . 3.4 

312067 T78968 Hs.14411 ESTs 3.5 

312090 T80177 Hs.118Q64 similar to rat nuclear ubiquitous casein 3.8 

312147 AI633744 Hs.195648 ESTs, Weakly similar to I38022 hypotheti 4.4 

312153 BE261944 Hs.118625 hexoklnasel 5.2 

312168 T92251 Hs.198882 ESTs 3.3 

312182 T94344 Hs.326263 ESTs 3.3 

312187 AA700439 Hs.188490 ESTs 3.4 

312199 AW438602 Hs.191179 ESTs 3,9 

312219 H73505 Hs.1 17874 ESTs 4 

312226 AA315703 Hs.199993 ESTs, Weakly similar to ALUBJHUMAN ill] 4.9 

312299 AA972712 . Hs.269737 ESTs 5.7 

312544 AA516420 Hs.183526 ESTs, Weakly similar to I38022 hypotheti 63 

312638 AW439195 Hs.256880 ESTs, Weakly similar to S65657 a!pha-1C- 4.9 

312826 AW291545 Hs.185018 ESTs 4.9 

312837 AW292286 Hs.255058 ESTs 4.4 

312980 AA497043 Hs.115685 ESTs 3.1 

313070 AI422023 Hs.161338 ESTs 4.3 

313079 N76497 Hs.1787 proteolipid protein 1 (Pefcaeus-Merzbac 3.3 

313089 AF026944 Hs.293797 ESTs 5.8 

313096 AW073310 Hs.163533 Homo sapiens cONA FU14142 fs, done MA 4.5 
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313126 AA746503 — Hs.283313 ESTs 10 

313166 AI801098 Hs.151500 ESTs 3.5 

313197 AW979008 Hs.222487 ESTs 3.3 

313280 AW960454 Hs.222830 ESTs 4.7 

313325 AI420611 ' Hs.127832 ESTs , 3.4 

313328 AW449211 Hs.105445 GDNF family receptor afrha 1 12.4 

313352 AW150945 Hs.144758 ESTs 4.1 

313385 AI032087 Hs.269819 ESTs 3 

313393 AI674685 Hs.200141 ESTs S2 

313417 AA741151 Hs.137323 ESTs 3.5 

313434 W92070 gb2h48g05.r1 SoaresJetaUveLspIeen. 3.7 

313569 AI273419 Hs.135146 hypothetical protein FU13984 . 3 

313591 AA046309 gb2f12fl)1.s1 SoaresJetaLhearLNbHH19W 5.6 

313615 AI540978 Hs.301997 hypothetical protein RJ13033 3.2 

313915 C18863 Hs.163443 Homo sapiens cDNA FU1 1576 fls, clone HE 26.3 

313975 AW175896 Hs.65114 keratin 18 3 

313979 AI535895 Hs.221024 ESTs 4.9 

313997 AV657317 Hs.288649 hypothetical protein MGC3077 3.9 

314043 AA827082 Hs.291872 ESTs 3.1 

314078 AW129357 Hs.329700 ESTs 8.3 

314097 AA648744 Hs.269493 ESTs 6.6 

314121 AI732083 Hs.187619 ESTs 6.2 

314129 AA228366 Hs.1 15122 ESTs 4 
314138 AA740616 gb:ny97f11.s1 NCLCGAPJ3CB1 Homo sapiens 5.9 

314236 AA743396 Hs.189023 ESTs 3.1 

314244 AL036450 Hs.103238 ESTs 4 

314305 AI2801 12 Hs. 125232 Homo sapiens cDNAFU 13266 fis, clone OV 8 

314306 AI697901 Hs.192425 ESTs 3.7 
314322 AA907153 Hs.190060 ESTs 3.3 
314394 AW961597 Hs.130816 ESTs, Moderately similar to 138022 hypot 42 
314401 AI660412 Hs.234557 ESTs 3.3 
314465 AA602917 Hs.156974 ESTs 4.7 
314506 AA833655 Hs.206868 Homo sapiens cDNA FLJ14056 Ms, done HE 8.5 
314510 AI204418 Hs.190080 ESTs 4 

314546 AW007211 Hs.16131 hypothetical protein FU1 2876 3.4 

314547 AA399272 Hs.144341 ESTs 6.7 
314558 AI873274 Hs.190721 ESTs 27.4 
314627 AA425310 Hs.155766 ESTs, Weakly similar to A47582 B-cefl gr 4.4 
314648 AW979268 gb:EST391378 MAGE resequences, MAGP Homo 4.6 
314691 AW207206 Hs.136319 ESTs 20.7 
314729 AA457367 Hs.191638 ESTs 3.6 
314754 AW026761 Hs.134374 ESTs 3.6 
314814 BE350122 Hs.157367 ESTs, Weakly similar to I78885 serine/lh 4.9 
314864 AW971198 Hs.294088 ESTs 4.3 

314881 AI095087 Hs.1 52299 ESTs, Moderately similar to S65657 alpha 3.7 

314882 AA828032 Hs.169076 ESTs 3.1 
314981 AW972359 Hs.293334 ESTs 3 
315006 AI538613 Hs.298241 Transmembrane protease, serine 3 10.9 
315021 AA533447 Hs.312989 ESTs 5.3 
315051 AW292425 Hs.163484 ESTs 12.9 
315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUC_HUMAN I 5.8 
315073 AW452948 Hs.257631 ESTs 4.2 
315080 AA744550 Hs.136345 ESTs 3.7 
315175 AI025842 Hs.152530 ESTs 6 
315183 AW136134 Hs.220277 ESTs 3.9 
315193 AI241331 Hs.131765 ESTs, Moderately similar to 138937 DNA/R 4.4 
315196 AI367347 Hs.44898 Homo sapiens cione TCCCTA00151 mRNA sequ 8.2 
315198 AI741506 Hs.165900 ESTs, Weakly similar to ALU1_HUMAN ALU S 3.6 
315240 R38772 Hs.172619 myelin transcription factor 3.4 
315263 AW510994 Hs.220740 ESTs 3.4 
315282 AI222165 Hs.144923 ESTs 4.9 . 
315296 AA876905 Hs.125286 ESTs 4 
315368 AB037745 Hs.104696 KIAA1324 protein 4.7 
315397 AA218940 Hs.137516 fidgetin-Oke 1 3.1 
315489 AI378817 Hs.191847 ESTs 3.1 
315498 AA628539 Hs.116252 ESTs, Moderately similar to AW1_HUMAN A 3.2 
315526 AI193043 Hs.1 28685 ESTs, Weakly similar to T17226 hypotheti 4.1 
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315530 AW015415 Hs.127780 ESTs 8.9 

315562 AA737415 Hs.1 52826 ESTs 5.5 

315634 AA837085 Hs.220585 ESTs 6.3 

315647 AA648983 Hs.212911 ESTs 3.6 

5 315707 AI418055 Hs.161160 ESTs 5.1 

315772 AW515373 Hs.271249 Homo sapiens cDNA FU13580 8s, done PL 3.1 

. 315850 AW270550 Hs.1 16957 ESTs 3.8 

315858 AA737345 Hs.294041 ESTs 5 

315878 AA683336 Hs.189046 ESTs 3.1 

10 315977 AW865916 Hs.151206 ESTs 4.7 

315978 AA830893 Hs.119769 ESTs 4.1 

315995 AI217477 Hs.194591 ESTs • 4.1 

316012 AA764950 Hs.119898 ESTs 7 

316042 A1469960 . Hs.170698 ESTs 4.9 

15 316052 AI962796 Hs.136754 ESTs 4.1 

316072 AW517524 Hs.135201 N0D2 protein 3.2 

316074 AW975114 Hs.293273 ESTs. 3.8 

316100 AW203986 Hs.213003 ESTs 3.2 

. 316133 AI187742 Hs.125562 ESTs 3.7 

20 316177 AI904982 Hs.293102 ESTs, Moderately similar to ALU1.HUMAN A 30.7 

316186 AI433540 • gb:ti69g05.x1 NCLCGAPJ<id11 Homosapien 3.1 

316244 AI640761 Hs.224988 ESTs 3.5 

316303 AA740994 Hs.209609 ESTs 3.8 

316313 AA741300 Hs.202599 ESTs, WeaWy similar to I38022 hypotheti 4.4 

25 316364 AA747807 Hs.149500 ESTs 3.2 

316580 AA938198 Hs.146123 pory(A) polymerase gamma 9.4 

316697 AW293174 Hs.252627 ESTs 4.4 

316715 A1440266 Hs.170673 ESTs, Weakly similar to T24832 hypotheti 3 

316868 A1660898 Hs.195602 ESTs . 3.2 

30 316869 AI954880 Hs.134604 ESTs 3.2 

316886 AA836331 Hs.134981 ESTs 4*4 

316897 AA838114 Hs.221612 ESTs 3.7 

316943 AW014875 Hs.137007 ESTs 4.6 

317069 AI732892 Hs.190489 ESTs 5.9 

35 317194 AW445167 Hs.126036 ESTs 4.1 

317360 AI125252 Hs.126419 ESTs 3.5 

317404 AI806867 Hs.126594 ESTs 5.1 

317452 AA972965 Hs.135568 ESTs 6.9 

317501 A1822034 Hs.137097 ESTs 4.6 

40 317674 AW294909 Hs.132208 ESTs 4.3 

317803 AW664964 Hs.128899 ESTs 6,1 

317834 X56348 Hs.287270 ret proto-oncogene (multiple endocrine n 3.1 

317850 AI681545 Hs.152982 hypothetical protein FU 131 17 3.4 

317881 AI827248 Hs.224398 Homo sapiens cDNA FU 1 1469 fis, done HE 9.6 

45 317902 AW102941 Hs.211265 ESTs 4.1 

317916 AI565071 Hs.159983 ESTs 103 

318042 AW294522 Hs.149991 ESTs 3.1 

318223 AI077540 Hs.134090 ESTs 3.9 

318327 AW294013 Hs.200942 ESTs 3 

50 318332 AI093930 Hs.163440 Homo sapiens cDNA: FU21000 fo, done C 4.4 

318418 AF107493 Hs.118498 Homo sapiens LUCA-15 protein mRNA, splic 5.4 

318558 AW402677 Hs.146381 RNA binding motif protein, X chromosome 4.4 

318625 AA526235 Hs.193162 Homo sapiens cDNAFUl 1983 fe, done HE 5.9 

318634 T49598 Hs.156832 ESTs 4 

55 318740 NMJW2543 Hs.77729 oxidised tow density lipoprotein (tecfti 7.3 

318744 AI793124 Hs.144479 ESTs 17.8 

318781 F11802 Hs.6818 ESTs 3 

319191 NM.012391 . Hs.79414 prostate epftheilum-spedficEtstranscr 3.6 

319476 AI524124 Hs.270307 ESTs 4.6 

60 319510 W88532 Hs.254562 ESTs 3.3 

319551 AA761668 gb:nz24c08.s1 NCLCGAPJ3CB1 Homo sapiens 3.2 

319745 T79366 Hs.108258 actin binding protein; macrophin (microf 3.3 

319834 AA071267 gb:zm61g01.r1 Stratagene fibroblast (937 6.2 

319840 C19035 Hs.164259 ESTs 3.3 

65 319977 AA534222 gb:n]21d02.s1 NCLCGAPJ\A1 Homo sapiens 43 

320074 AA321166 Hs.278233 ESTs . 3.4 

320167 AA984373 Hs.90790 Homo sapiens cDNA: FU22930 fis, done K 4.1 
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320187 799949 Hs.303428 Homo sapiens cDNA FU14832 fis, clone OV 5.3 

320211 AL039402 Hs.125783 DEME-6 protein 9.2 

320416 AI026984 Hs.293662 ESTs 3,1 

320588 U78082 Hs.167738 RNA polymerase II transcriptfona) regula 3.1 

320635 N50617 . Hs.80506 small nuclear ribonucleoprotein polypept 6.1 

320654 AI160015 Hs.118112 ESTs 3.5 

320742 AI601188 Hs.120910 ESTs 3 

320832 AA214584 Hs.290167 ESTs 3.7 

320915 AI359144 Hs.143688 Homo sapiens cONA: FU23031 fis, done L 3.1 

321016 BE144167 Ks.49994 hypothetical protein similar to RNA-btnd 3.3 

321107 AI732643 Hs.144151 ESTs 12.3 

321171 AI769410 Hs.221461 ESTs 3.3 

321253 AA610649 Hs.333239 ESTs 3 

321318 AB033041 Hs.137507 vang (van gogh, DrosophBaHike 2 3.9 

321642 AI432199 Hs.247084 ESTs 3 

321644 AW975944 Hs.237396 ESTs 117 

321683 AI471598 Hs.197531 ESTs 3.8 

321756 U29112 Hs.196151 ESTs 4.4 

321811 D80630 gb:HUM091D02B Human fetal brain (TFuJiwa 3.2 

321828 R59890 Hs .83623 nuclear receptor subfamBy 1, group I, m 3.1 

321910 H67065 Hs.271530 ESTs, Weakly similar to ALU7_HUMAN ALU S 4.7 

321937 AL049351 Hs.302058 Homo sapiens mRNA; cDNA DKFZp566C093 (fr 3.5 

321978 N77342 Hs.21851 Homo sapiens cDNA FU12900 fis, clone NT 5 

322035 AL137517 Hs.334473 hypothetical protein DKFZp56401278 19 

322136 AF075083 gb:Homo sapiens full length insert cDNA 3.6 

322258 BE265745 Hs.194359 ESTs, WeaWy similar to ALUC_HUMAN llll 3 

322296 W76326 gb:zd60d04.r1 SoaresJetaUiearLNbHH19W 4.4 

322303 AI357412 Hs.157601 ESTs 11.5 

322476 AW963372 Hs.46677 PRO2000 protein 3 

322520 T55958 gb:yb35f05.r1 Stratagene fetal spleen (9 3 

322521 AF147347 gb:Homo sapiens full length insert cDNA 4.2 
322567 AF155103 Hs.256150 Homo sapiens, SimiiartoRIKENcONA 2810 4 
322595 W92147 Hs.1 18394 ESTs 5.4 
322675 AA017656 gb:ze39h01.r1 Soares retina N2b4HR Homo 3.1 
322766 AW068805 Hs.288467 Homo sapiens cDNA FU12280 fis, done MA 5.2 
322818 AW043782 Hs.293616 ESTs 7.6 
322882 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, done HE 5.9 
322975 C16391 gb:C16391 Ciontech human aorta polyA mRN 16.5 
323091 AI902456 Hs.210761 ESTs, WeaWy similar to I38022 hypothet! 4 
323131 AK002088 Hs.270124 Homo sapiens cDNA FU11226 fis, clone PL 3.3 
323168 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 6.3 
323244 AW675572 Hs.193620 ESTs 4.6 
323262 AL133990 Hs.190642 ESTs 10.5 

323332 AI829520 gbwl19c06jc1 NCLCGAP.Utl Homo sapiens 6.2 

323333 AV651680 Hs.208558 ESTs 4.3 
323335 AI655499 Hs.161712 ESTs 9.2 
323645 AW445014 Hs.197746 ESTs 3.1 
323663 BE081058 Hs.243023 ESTs 4 
323693 AA317962 Hs.249721 ESTs, Moderately similar to PC4259 ferri 3 
323782 AW961560 Hs.97600 ESTs 3.2 
323817 AA410943 BMP-R1B 8.4 
323930 AL043683 Hs.8173 hypothetical protein FU10803 3.3 
323974 AI825204 Hs.211408 ESTs 4.5 
324001 AL044949 Hs.1 16298 ESTs 4.5 
324036 AW72078 Hs.303662 ESTs 8.4 
324261 BE069341 gb:QV3-BT0381-27010(W73-c08 BT0381 Homo 49.4 
324285 AA431159 Hs.122954 ESTs 3 
324296 AI524039 Hs.192524 ESTs 3 
324305 AA642007 Hs.1 16369 ESTs . 3.3 
324432 AA464510 Hs.152812 ESTs 16.5 
324585 AI823969 Hs.132678 ESTs 3.3 
324598 AW972227 Hs.163986 Homo sapiens cDNA: FU22765 fis, clone K 5 
324603 AW993522 Hs.292934 ESTs 10.4 
324631 AA937116 Hs.293683 ESTs, Weakly similar to 154374 gene NF2 3.3 
324716 BE169746 Hs.12504 likely ortholog of mouse Arkadia 3.2 
324748 AW974941 Hs.292385 ESTs, Weakiy similar to I78885 serine/th 3 
324771 AA631739 Hs.335440 EST 3 
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324774 AI031771 Hs.132586 ES7s - 4.2 

324823 AW516704 Hs.208726 ESTs 3.4 

324824 AI826999 Hs.224624 ESTs 3.1 
324826 AA7O4806 Hs.143842 ESTs, Weakly similar to 2004399A chromos 4.4 
324961 AA613792 gb:no97h03.s1 NCLCGAP_Pr2 Homo sapiens 3.9 
324987 AI375572 Hs.172634 ESTs 18.8 
324994 AI805416 Hs.213897 ESTs 3.3 
325146 AI064690 Hs.171176 ESTs 4.2 
325372 Phase 2 & 3 Exons 4.4 
325544 Phase 2 & 3 Exons 57 
327075 Phase 2 &3 Exons 3.8 
332798 C220000075i|12314195|emb|CAB99338.1 1 (A 4.3 
334223 NM_005080*:Homo sapiens X-box binding pr 26.2 
334447 NMJ12429*:Homo sapiens SEC14 (S. cerevi 3.9 
335809 NMJ14509*:Homo sapiens kraken-llke (BK1 10.1 
335824 ENSP00000249072*:DJ222E13.1(N-TERMINAL 20 
338255 NMJ1 4323*:Homo sapiens zinc finger prot 9 
409430 R21945 Hs.166975 splicing factor, arglnine/serine-rlch 5 4 
428046 AW812795 Hs.155381 ESTs, Moderately similar to I38022 hypot 4.6 
432558 R97268 Hs.177269 ESTs 3.2 
436808 AA731602 . Hs.120266 ESTs 3.9 
448569 BE382657 Hs.21486 signal transducer and activator of trans 4.1 
453542 AW836724 Hs.339660 Homo sapiens mRNA expressed only in plac 3.7 

M97935 AFFX control: STAT1 3.2 

M97935 AFFX control- STAT1 3 

M55150 fumarytacetoacetatB 3 

M13755 interferon stimulated protein; 15 kDa 4.5 

AI052Q47 ESTs 6.7 

AA252033 ESTs; Weakly similar to HI! ALU SUBFAMILY J 3.2 

AA401739 ESTs 3.3 

H 18459 hepatocellular carcinoma associated protein; 3 

R48744 ESTs 4.2 

M31 682 inhibin; beta B (activin AB beta polypeptide) 3 

AA416873 ESTs 3 

D80240 HUM5G1 1 A Human fetal brain (TFujlwara) Homo 4 

R49590 ESTs 3.2 

CH2?_FGENES.678J5 16.8 

CH2i.FGENES.619J 12.9 

CH22JGENES.619J2 11-3 

CH22_EM:AC005500.GENSCAN.127,9 9.2 

CH22_EM:AC005500.GENSCAN.304 2 8.5 

CH2i.FGENES.271J 8.4 

CH22JGENES.619J3 8 

CH22JGENES.271J 7.3 

CH22JGENES.617J 7.2 

CH.07 hsgi|6004473 7.1 

CH2i.FGENES.264 J 6.8 
X03363 HER2 receptor tyrosine kinase (c erbB 2; ERBB2; neu) 6.6 

. ■ CH2i.FGENES.617J 6.5 

CH.07 Jsgi|5868264 5.8 . 

CH.19 hsgi|5867439 5.7 

CH22JGENES.6 3 5.3 

CH.17Jsgi|5867230 5.1 

CR20Jsgi|6552458 5.1 

CH22JM:AC005500.GENSCAN.148 22 4.7 

CH22 FGENES.669J0 4.6 

AA034918 KIAA1028 protein 4.6 

CH22JGENES.48J2 4.5 

CH22.FGEhES.118J 4.5 

AF049569 ESTs 4.4 

M13955 multiple UniGene matches 4.3 

CH2i.FGENES.619J 4.3 

CH22JGENES.137 4.3 

HG4126 HT4396 Zinc Finger Protein Hzf4 

CH22LFGENES.360J 4.3 

CH22JGENES.706J 4.3 

CH.21Jis gl[S531965 " 42 
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HG2614HT2710 
X83535 



10 



HG4716HT5158 



15 



HG4677HT5102 



20 



AA714311 



25 



HG2465 HT4871 



30 



AA707750 



35 



40 



AA976074 



45 



CH.17_hsgl|5867215 
CH22_FGENES.669J 

CH22.FGENES.48J8 

matrix metalbproteinase 14 (membrane Inserted) 
CH22J=GENES.27L6 
CH22_FGENES.617_3 
CH^FGENES^J 

CH2a.FGENES.13 5 
CH22_FGENES.132 
CH.14_hsg!|6682474 
CH.02Jisgi[5867750 
CH22J r GENES.617J 

CH22_DJ32I10.GENSCAN.23 39 
CH22_FGENES.543J0 
CH2^EM:ACQ05500.GENSCAN.96 1 
CH22.FGENES.204J 
CH22.FGENES.619 J 
CH.16Jisgi|5867087 
EST cluster (not In UniGene) 
CH22_EM:AC005500.GENSCAN.149 9 
CH22_EMAC005500.GENSCAN.421 5 
CH22.FGENES.134 
CH.07_hsgf|6004478 
CH22J : GENES.360_1 

CH22__FGENES.6_2 

CH22_C20H12.GENSCAN.162 

CH22_C65E1.GENSCAN.81 . 

ESTs; Weakly similar to cts Golgi matrix 

CH22^FGENES.307J 

CH22_EM AC005500.GENSCAN.248 14 

CH.06JisgI|5902482 

CH22_FGENES.669_5 

CH22_D032I10.GENSCAN.19 8 

CH22_FGENES.527_6 

CH22_FGENES.330J0 

CH22.FGENES.142 

ESTs 

CH22_FGENES.2267 
CH22J=GENES.133 
CH22.EMAC005500.GENSCAN.209 12 
CH22.FGENES.271.3 



4.1 " 
4.1 

Collagen, Type Viii, Alpha 1 4.1 

4.1 

4 

3.9 
3.9 
3.8 

Guanoslne 5' Monophosphate Synthase 3.8 

3.8 

3.8 

3.8 

3.8 

3.7 

Oncogene Ret/Ptc2, Fusion Activated 3.7 

3.7 

3.7 

3.7 

3.5 

3.5 

3.5 

3.4 

3.4 

3.4 

3.3 

3.3 

3.3 

Ona Binding Protein Ap 2, Art Splice 3 3.3 

3.3 

3.2 

3.2 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3.1 

3 

3 

3 

3 

3 
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TABLE 17A 
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Table 17A shows the accession numbers for those pkeys lacking unigenelD's for Table 17. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



116845 
103207 
126257 
102791 



393481J 
30635.-4 
182217J 
37186.1 



126872 142696J 



112631 
120742 
106864 
109700 
111532 
113938 
113947 
124357 
108733 
112303 
322136 
322296 
321811 
314648 
322520 
322521 
322675 
323332 
316186 
322975 
324261 
323817 
301976 
324961 
303642 
303797 
319551 
311935 
319834 
319977 
314138 
313591 



1746257J 
176835.1 
324239J 
genbanR_F09609 
genbanK_R08440 



AA649530 M659316H64973 
X72790" 

N99638 AW973750 AA328271 H90994 AA558020 AA234435 N59599 R94815 

AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 A1680833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI583718 AI672574 
N25695 AW665466 AI818326 AA1261 28 AI480345 AW013827 AA248636 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 A1867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 A1624817 BE46661 1 AI206344 AA574397 
AA348354 AI493192 

AW450979 AA136653 AA136656 AW419381 AA984358 AA492073 BE168945 AA809054 AW238038 BE011212 BE011359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
R82040 R70934 
AA225084AA302713 

AI31 1928 AA936030 T51931 AA609B16 AA487195 AA664207 
F09609 
R08440 



genbank_W81598 W81598 . 
genbanleW84768 W84768 
genbank.N22401 N22401 
504187 J AA121022AA126422 
genbanleR54797 R54797 
46802.1 AF075083 H52291 H52528 
47334.1 W76326 AF086341 W72300 
1527481.1 D80630 D80896 D80895 
293660 1 AW979268 AA878419 AA431342 AA431628 

38916 1 T55958 T57205 AF147346 

38917 1 AF147347 T55426T55503 
86787 1 AA017656 AA017374 AA019761 

179142.1 AI829520 AI791832 AA228414 AI791823 AA229211 AA229315 
425440 1 AI433540AA728984AA804981 
1510563.1 C16391 C16413 

273265 1 BE069341 AW748403 AL044891 AJ90824O AA393080 

233566 J AA410943AW948953M334202AA332882 

128835.1 T97905 AA101672 

376239 1 AA613792 AW182329 T05304 AW858385 

284260 1 AW299459AA417112 

386364 1 AW629759AW749955AA633408AI651005 

357371~ 1 AA761668 AA573621 R92814 R09670 

174129.1 AA216387T63548AA228676 

112523 1 AA071267 T65940T64515M071334 ■ 

345248.1 AA534222 AA632632 T81234 

179960 1 AA740616 AA654854 AA229923 

103087 1 AA046309 AI263500 AA046397 
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15 



308106 AW76803 

338255 CH22 6856FG_LINieEM:AC00 

335809 CH22J181FGJ17JJ.INKJM 

335824 CH22J197FGJ19J1J.INKJ 

307010 AI140014 ' 

307041 AJ144243 

305913 AA876109 

305917 AA876469 

309574 AW168083 

325372 c12Jis 

325544 c12Jis 

332798 CH22J4FG_A.5_UNKJI4G1.G 
334223 CH22_1507FG_360JJJNK_EM 
327075 c21Jis 

334447 CH22.1746FG.387J_LINK_EM 
304782 AA582081 

313434 441798J W92070 AW019952 W92053 



PCT/US02/02242 
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TABLE 17B 

Table 17B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 17. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

10 Ref Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et aT refers to the publication 

1 entitled "The DNA sequence of human chromosome 22." Dunham I. et aL, Nature (1999) 402:48^495. 
Strand: Indicates DNA strand from which exons were predicted. 

Indicates nucleotide positions of predicted exons. 



15 

Pkey Ref Strand Nt_positton 

334447 Dunham, I. etal Plus 14308764-14308824 

. 335809 Dunham, I. etal. Plus 26310772-26310909 

20 335824 Dunham, LetaL Plus 26376860-26376942 

332798 Dunham, i etal. Minus 232147-231974 

334223 Dunham, I. etal. Minus . 12734365-12734269 

338255 Dunham, I. etal. Minus 15242294-15242231 

325372 5866920 . Minus 1117061-1117304 

25 325544 6682452 Plus 171228-171286 

327075 6531965 Plus 40413184041431 
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TABLE 18: Table 2 from BRCA 014 P 

ft V * * 

Table 18 shows genes with atleast five times the expression in breast tumor tissue than is 
expressed in normal body tissues. 



i 



Pkey: 
ExAccn: 
UnigenelD: 
Unlgene Title: 
R1: 



Unique Eos probeset Identifier number 
Exemplar Accession number, Genbank at 
Unlgene number 
Unigene gene title 
Ratio of tumor to normal body tissue 



sfon number 



Pkey ExAccn UnigenelD Unlgene Title 



101378 
101530 
101787 
101878 
103010 
104115 
104825 
107105 
108819 
112287 
112561 
112637 
113206 
113970 
114965 
118925 
119905 
121611 
129301 
133976 
134731 
300254 
301884 
302001 
302067 
302276 
302290 
302372 
302385 
309177 
309583 
310781 
311166 
311935 
312153 
313328 
313915 
314097 
314138 
314506 
314558 
314691 
315006 
315021 
315051 
315060 



BE563085 

M29874 

M81057 

M97815 

X52509 

AF183810 

AA035613 

AW963419 

AA011449 

AB033064 

AI791493 

R82331 

BE262470 

W27249 

AI733881 

N92293 

AW449064 

M31669 

AF182277 

A1908165 

D89377 

AW183618 

AA312082 

AB020711 

BE542706 

AW057736 

AA179949 

AL1 17406, 

AJ224172 

AI951118 

AW170035 

AI380797 

AI821005 

AA216387 

BE261944 

AW449211 

C18863 

AA648744 

AA740616 

M833655 

A1873274 

AW207206 

A1538613 

AA533447 

AW292425 

AA551104 



Hs.833 

Hs.1360 

Hs.180884 

Hs.183650 

Hs.161640 

Hs.26102 

Hs.141883 

Hs.155223 

Hs.271627 

Hs.334806 

Hs.129873 

Hs.164599 

Hs.241471 

Hs.8109 

Hs.72472 

Hs.206832 

Hs.1 19571 

Hs.1735 

Hs.330780 

Hs.169946 

Hs.89404 

Hs.55610 

Hs.105445 

Hs.278346 

Hs.222399 

Hs.323910 

Hs.175563 

Hs.200102 

Hs.204096 

Hs.326736 

Hs.326736 

Hs.158992 

Hs.1 18599 

Hs.1 18625 
Hs.105445 
Hs.163443 
Hs.269493 

Hs.206868 
Hs.190721 
Hs.136319 
Hs.298241 
Hs.312989 
Hs.163434 
Hs.189048 



interferon-stimulated protein, 15 kDa 
cytochrome P450, subfamily IIB (phenobar 
carboxypeptidaseBI (tissue) 
cellular retinoic add-blnding protein 2 
tyrosine aminotransferase 



ESTs 

stannlocalcin 2 
ESTs 

KIAA1238 protein 

ESTs, Weakly similar to A36036 cytochrom 
ESTs 
RNB6 

hypothetical protein BJ21080 
BMP-R1B 

ESTs, Moderately similar to ALU8.HUMAN A 
collagen, type III, alpha 1 (Ehlers-Danl 
inhibin, beta B (adivtn AB beta polypep 
cytochrome P450, subfamily IIB (phenobar 
GATA-binding protein 3 (T-ceB receptor 
msh (Drosophila) homeo box homotog 2 
solute carrier family 30 (zinc transport 
GDNF family receptor alpha 1 
KIAA0904 protein 
CEGP1 protein 

HER2 receptor tyrosine kinase (o-erWft 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 34.1 



R1 

5.3 
9 
12 
6.5 
12.4 
7.6 
6.9 
5.3 
6.1 
7.3 
8.2 
5.4 
6.2 
6.9 
10.1 
19.7 
8.4 
5.6 
6.2 
6.2 
5.8 
9.9 
5.7 
7.7 
7.3 
5.4 



6.7 
13.8 
17.3 
57.6 
10.2 
10.8 



ATP-blnding cassette transporter MRP8 
lipophiiin B (uteroglobin family member) 
Homo sapiens breast cancer antigen NY-BR 
Homo sapiens breast cancer antigen NY-BR 
ESTs 
ESTs 

gb:nc16b02.s1 NCLCGAPJ>r1 Homo sapiens 5.2 

hexokinase 1 5.2 

GDNF family receptor alpha 1 12.4 

Homo sapiens cDNA FU11576 fis, clone HE 26.3 

ESTs 6.6 
gb:ny97f11.s1 NCt_CGAP_GCB1 Homo sapiens 5.9 

Homo sapiens cDNA FU14056 fis, clone HE 8.5 

ESTs 27.4 

ESTs 20.7 

Transmembrane protease, serine 3 10.9 

ESTs 5.3 

ESTs 12.9 

ESTs, Moderately similar to ALUC JHUMAN I 5.8 
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10 



15 



20 



25 



30 



35 



40 



315196 
315530 
315634 
316012 
316177 
316580 
317603 
317881 
318740 
318744 
320211 
321107 
321644 
321978 
322035 
322766 
322818 
322975 
323262 
323332 
323817 
324261 
324432 
324598 
324603 
324987 
325544 
330388 
334223 
335809 
335824 



AI367347 Hs.44898 
AW015415 Hs.127780 
M837085 Hs.220585 
AA764950 Hs.1 19898 
AI904982 Hs.293102 
AA938198 Hs.146123 
AW664984 Hs.128899 
A1827248 Hs.224398 
NIVL002543HS.77729 
AI793124 Hs.144479 
ALO39402 Hs.125783 
AI732643 Hs.144151 
AW975944 Hs.237396 
N77342 , Hs.21851 
AL137517 Hs.334473 
AW068805 Hs.288467 
AW043782 Hs.293616 
C16391 

AL133990 Hs.190642 
AI829520 

AA410943 , 
BE069341 

AA464510 Hs.152812 
AW972227 Hs.163986 
AW993522 Hs.292934 
AI375572 Hs.172634 

Hs.46 



AI052047 
R72427 



Homo sapiens clone TCCCTA00151 mRNA sequ8.2 
ESTs • 8.9 
ESTs 6.3 
ESTs 7 
ESTs, Moderately similar to ALU1JWMAN A 30.7 
poly(A) polymerase gamma 9.4 
ESTs 6.1 
Homo sapiens cDNA FU1 1469 fis. done HE 9.6 
oxidised low density lipoprotein (lectin 7.3 
ESTs 17.8 
DEME-6 protein 9.2 
ESTs . 12.3 
ESTs 11.7 
Homo sapiens cDNA FLJ 12900 fis, done NT 5 
hypothetical protein DKFZp56401278 19 
Homo sapiens cDNA FLJ 12280 fis, clone MA 5.2 
ESTs 7.6 
gb:C16391 Ciontech human aorta poiyA mRN 16.5 
ESTs 10.5 
gb:w!19c06Jc1 NCLCGAPJJtl Homo sapiens 6.2 
BMP-R1B 8.4 
gb:QV3.BT0381-270100mc08 BT0381 Homo 49.4 
ESTs 165 
Homo sapiens cDNA: FU22765 fis, done K 5 
ESTs 10.4 
ESTs 1&8 
Phase 2 & 3 Exons 5.7 
HER2 receptor tyrosine kinase (c-erb-b2, 6.6 
NML005080*:Homo sapiens X-box binding pr 26.2 
NMJ)14509*:Homo sapiens kraken-like (BK1 10.1 
ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
ESTs; Weakly similar to CYTOCHRO WE P450 6.7 

5.5 
9.2 
8 

6.5 
7,3 
119 
8.4 
11.3 
8.5 
7.1 
7.2 
16.8 



CH22_EM:AC005500.GENSCAN.127 9 
CH22J r GENES.619J3 
CH22 W FGENES.617J 
CH22_FGENES.271_7 
CH22_FGENES.619J 
CH22_FGENES.271 8 
CH22.FGENES.619 12 
CH2Z_EMAC005500.GENSCAN.304 2 
CH.07Jisgi|6004473 
CH22.FGENES.617J 
CH22JGEN£Smj 
CH22_FGENES.678_5 
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TABLE 18A 



10 



Table 18A shows the accession numbers for those pkeys lacking unigenelD's for Table 18. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



15 



20 



25 



Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT number Accession 



323332 179142J 
322975 1510563J 
324261 273265J 
323817 233566J 
311935 174129J 
314138 179980J 



A1829520 AI791832 AA228414 AI791823 AA22921 1 AA229315 
C16391 C16413 

BE069341 AW748403 AL044891 AI908240 M393080 
AA410943 AW948953 AA334202 AA332882 
AA216387 T63548 AA228676 
AA740616 AA654854 AA229923 



335809 CH22_3181FGJ17_6JJN»eEM 

335824 CH2^3197FG_619J1_LINJ^E 

325544 c12_hs 

334223 CH22_1507FGJ60J.UNK.EM 
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TABLE 18B 

Table 18B shows the genomic positioning for those pkeys lacking unigehe ID's and 
5 accession numbers in Table 18. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

10 Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (GI) numbers. "Dunham I. et al.' refers to the publication 

entitled The DNA sequence of human chromosome 22." Dunham I. et al., Nature (1 999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. - 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref Strand Nlposftion 



335809 Dunham, I. etal. Plus 26310772-26310909 

20 335824 Dunham, I; etal. Phis 26376860-26376942 

334223 Dunham, I. etal. Minus . 12734365-12734269 

325544 6682452 Plus 171228-171286 
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TABLE 19: 1045 GENES UP-REGULATED IN BREAST CANCER 

COMPARED TO NORMAL ADULT TISSUES 

» 

I 

T 

5 Table 19 shows 1045 genes up-regulated in breast cancer compared to normal adult tissues. 
These were selected from 59680 probesets on the Affymetrix/Eos-Hu03 GeneChip array such 
that the ratio of "average" breast cancer to "average" normal adult tissues was greater than or 
equal to 2.5, The "average" breast cancer level was set to the 90 th percentile value. The 
"average" normal adult tissue level was set to the 90 th percentile value amongst 144 non- 
10 malignant tissues. In order to remove gene-specific background levels of non-specific 

hybridization, the 15 th percentile value amongst the 144 non-malignant tissues was subtracted 
from both the numerator and the denominator before the ratio was evaluated. 



15 Pkey: Unique Eos probeset identifier number 

ExAocn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of tumor to normal body tissue 

20 



Pkey ExAccn UnigenelD UnigeneTrtle R1 

408591 AF015224 Hs.46452 mammaglobinl 137.6 

25 4Q6S64 M21305 gb:Human alpha satellite and satellite 3 71.0 

400291 AA401369 Hs.190721 ESTs 68.4 

407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 54.2 

449746 AI668594 Hs.176588 ESTs, Weakly similar to CP4Y_HUMAN CYTOC 46.4 

426878 BE069341 gb:QV3-BT0381-27010M73-c08 BT0381 Homo 44.8 

30 400292 AA250737 Hs.72472 BMP-R1B 37.4 

427585 D31 152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 32.9 

408045 AW138959 Hs.245123 ESTs 31.9 

407178 AA195651 Hs.104106 ESTs 30.4 

407377 C16391 gb:C1 6391 Clontech human aorta polyAmRN 27.7 

35 450705 U90304 Hs.25351 Iroquois homeobox protein 5 24.8 

407212 AA412108 Hs.269350 ESTs 22.0 

428848 NMJ100230 Hs.194236 leptin (murine obesity homolog) 21.9 

404561 trichorhinophalangeal syndrome I (TRPS1) 21.8 

407980 M046309 gb:zf12ID1.s1 SoaresJetel_hearLNbHH19W 19.8 

40 447350 AI375572 Hs.172634 ESTs 17.3 

450375 M009647 Hs.8850 a dislntegrin and metalloprotelnase coma 16.6 

422109 S73265 Hs.1473 gastrin-releasing peptide 16.5 

435496 AW840171 Hs.265398 ESTs, Weakly similar to transformation* 16.0 

453160 AI263307 Hs.239884 H2B histone family, member L 15.8 

45 420813 X51501 Hs.99949 prolactin-induced protein 15.8 

415989 AI267700 Hs.317584 ESTs 15.5 

422505 AL120862 Hs.124165 programmed cell death 9 (PDCD9) 14.8 

424399 AI905687 Hs.2533 aldehyde dehydrogenase 9 family, member 14.5 

423575 C18863 Hs.163443 Homo sapiens cDNA FU1 1576 fts, clone HE 13.7 

50 429441 AJ224172 Hs.204096 lipophiiin B (uteroglobin family member) 13.6 

431474 AL133990 Hs.190642 ESTs 13.5 

448595 AB014544 Hs.21572 K1AA0644 gene product • 13.0 

427217 AA399272 Hs.144341 ESTs 12.8 

402578 C1001134:gi|2117372|pir|ll65981fattyac 12.6 

55 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

424634 NMJ03613 Hs.151407 cartilage intermediate layer protein, nu 12.0 

456207 AA193450 gfczr40e07.r1 Soares JmHMPu_S1 Homo sapi 11.9 

424086 AI351010 Hs.102267 lysyl oxidase 11.9 

459587 M031956 gb.7k15e04.s1 SoaresjregnanLuterusJibH 115 
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416208 AW291168 Hs.41295 ESTs, Weakly similar to MUG2_HUMAN MUCIN 11.5 

429170 NMJXJ1394 Hs.2359 dual specificity phosphatase 4 11.5 

407276 AI951118 Hs.326736 Homo sapiens breast cancer antigen NY-BR 11.4 

434377 AW137148 Hs.306593 Homo sapiens cDNAFU 11382 fis, clone HE 11.3 

5 448390 AL035414 Hs.21068 hypothetical protein 11.1 

452401 NWL007115 Hs.29352 tumor necrosis factor, alpha-induced pro 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) 10.9 

452461 N78223 Hs.108106 transcription factor . 10.7 

443348 AW873596 Hs.182278 calmodulin 2 (phosphorylase kinase, delt 10.6 

10 4211» H87879 Hs.1 02267 lysyl oxidase 105 

' 402606 NM_024626:Homo sapiens hypothetical prot 10.4. 

409269 AA576953 Hs.22972 hypothetical protein FU13352 10.4 

447268 AI37G413 Hs.36563 hypothetical protein FU22418 103 

447033 AI357412 . Hs.157601 ESTs 10.2 

15 400295 W72838 Hs.2533 aldehyde dehydrogenase 9 family, member 10.1 

424905 NWL002497 Hs.153704 NIMA (never in mitosis gene a)-relatedk 10.1 

432441 AW292425 Hs,163484 ESTs 9.9 

427365 AI873274 Hs.190721 ESTs 9.9 

438950 H23789 Hs.144530 EST , 9.8 

20 422835 BE218705 Hs.121378 metaitothionein-tike 5, testis-specific 9.7 

425692 D90041 Hs.155956 N-acetyltransferase 1 (aiylarnine N-acety 9.7 

411869 W20027 Hs.23439 ESTs 9.6 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

445730 A1624342 Hs.170042 ESTs 9.5 

25 459583 AI907673 gb:IL-8T152-080399-004BT152Homosapien 9.3 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr 9.2 

432596 AJ224741 Hs.278461 matrilin3 9.1 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401 278 9.1 

449448 D60730 Hs.57471 ESTs 9.1 

30 423945 M410943 gb:zt32h03.r1 Soares ovary tumor NbHOT H 9.1 

408348 TargetExon 9.0 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 9.0 

453392 U23752 Hs.32964 SRY (sex determining region Y)-box 11 9.0 

433365 AF026944 Hs.293797 ESTs 8.8 ■ 

35 405654 NA C1 2001521 :gi|751 3934|pir||T31081 cca3pr 8.8 

418601 AA279490 Hs.86368 calmegin 8.8 

451110 AI955040 Hs.265398 ESTs, Weakly similar to transformation-r 8.7 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 8.5 

409041 AB033025 Hs.50081 KIAA1 199 protein 8.4 

40 442353 BE379594 Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A .8.3 

451561 N52812 Hs.177403 ESTs 8.2 

424001 W67883 Hs.137476 paternally expressed 10 8.2 

429859 NM_007050 Hs.225952 protein tyrosine phosphatase, receptor t 8.1 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.1 

45 405095 NA TaigetExon 8.1 

419296 AA236115 Hs.120785 ESTs 8.0 

447164 AF026941 Hs.17518 Homo sapiens dg5 mRNA, partial sequence 8.0 

413472 BE242870 Hs.75379 solute carrier family 1 (glial high affi 8.0 

416747 AW876523 Hs.15929 hypothetical protein FU1 2910 8.0 

50 415385 R17798 Hs.7535 COBW-like protein 7.9 

434424 AI811202 Hs.325335 Homo sapiens cDNA: FU23523 fis, clone L 7.9 

420931 AF044197 Hs.100431 small Inducible cytokine B subfamily (Cy 7.9 

406687 M31126 Hs.272620 matrix metaltoprotelnase 1 1 (MMP11;stro 7.8 

400285 NA Eos Control 7.7 

55 437207 T27503 Hs.15929 hypothetical protein FU12910 7.6 

427119 AW880562 Hs.114574 ESTs 7.5 

429534 AW976987 Hs.1 63327 ESTs, Weakly similar to 21 09260AB cell -7.5 

433426 H69125 Hs.133525 ESTs 7.5 

411078 AI222020 Hs.182364 CocoaCrfep 7.4 

60 426214 H59846 Hs.128355 ESTs, Moderately similar to AIU7_HU MAN A 7.4 

447475 A1380797 Hs.158992 ESTs 7.3 

415263 AA948033 Hs.130853 ESTs 7.2 

439569 AW602166 Hs.222399 CEGP1 protein 72 

414142 AW368397 Hs.150042 Homo sapiens cDNA FU14438 fis, clone HE 7.1 

65 426261 AW242243 Hs.1 68670 peroxisomal famesylated protein 7,0 

444783 AK001468 Hs.62180 anilfin (Drosophlla Scraps homolog), act 65 

445885 AJ734009 Hs.127699 KIAA1 603 protein 6.9 . 
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429432 AI678059 Hs.202676 synaptonemal complex protein 2 6.9 

410781 AI375672 Hs.165028 ESTs 6.9 

443788 AI732643 Hs.144151 ESTs 6.9 

421373 AA808229 Hs.167771 ESTs 6.8 

451398 AI793124 Hs.144479 ESTs 6.8 
404253 NWL021058*:Homo sapiens H2B hlstone faml 6.8 

441098 AI015591 Hs.131004 ESTs, Weakly similar to T17227 hypofheti 6.7 

426215 AW963419 Hs.155223 stannlocateln 2 . .6.6 

428227 AA321649 Hs.2248 small Inducible cytokine subfamily B (CX 6.6 

422956 BE545072 Hs.122579 hypothetical protein FU10461 6.6 

411111 AW818127 gb:CM1-ST0277-061299^)59-b07ST0277Homo6.6 

434988 AI418055 Hs.161160 ESTs • 6.6 

442580 AI733682 Hs.1 30239 ESTs 6.6 

449611 AI970394 , Hs.197075 ESTs 6.6 

408000 L11690 Hs.620 bullous pemphigoid antigen 1 (230/240kD) 6.5 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterone r 6.5 

431089 BE041395 Hs.283676 ESTs, Weakly similar to unknown protein . 6.5 

400301 XQ3635 Hs.1657 estrogen receptor 1 6.5 

427358 AW023482 Hs.97849 ESTs 6.5 

425704 U79293 Hs.1 59264 Human clone 23948 mRNA sequence 6.4 

441134 W29092 Hs.7678 cellular retimrfc acid-binding protein 1 6.4 

424902 NNL003866 Hs.153687 Inositol poryphosphate-4-phosphatase, ty 6.4 

448693 AW004854 Hs.228320 hypothetical protefo FU23537 6.4 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401278 6.2 

444342 NM_014398 Hs.10887 similar to lysosome-associated membrane 6.1 

422168 AA586894 Hs.1 12408 S1 00 caidum-bmding protein A7 (psorfas 6.1 

453331 AI240665 Hs.8895 ESTs 6.1 

418007 M13509 Hs.83169 matrix metailopratelnase 1 (MMP1; Inters 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

430044 AA464510 Hs.152812 ESTs 5.9 

432837 AA310693 Hs.87329 HSPC072 protein 5.9 

433285 AW975944 Hs.237396 ESTs 5.9 

450701 H39960 Hs.283467 Homo sapiens cDNA FU12280 fis, clone MA 5.9 

425707 AF1 15402 Hs.1 1713 E74-fike factor 5 (ets domain transcript 5.9 

410785 AW803341 gb: IL2-UM0079-090300-050-D03 UM0079 Homo 5.9 

425398 AL049689 Hs.156369 hypothetical protein similar to tenasctn 5.9 

414812 X72755 Hs.77367 monokine induced by gamma interferon 5.8 

459371 R20991 gb:yg06hO1.r1 Soares infant brain 1NIB H 5.8 

411284 N28519 Hs.135191 ESTs, Weakly similar to unnamed protein 5.8 

453511 AL031224 Hs.33102 transcription factor AP-2 beta (activati 5.8 

451807 W52854 Hs.27099 hypothetical protein RJ23293 similar to 5.7 

430510 AW162916 Hs.241576 hypothetical protein PR02577 5.7 

415539 A1733881 Hs.72472 BMP-RIB 5.6 

438199 AW016531 Hs.1 22147 ESTs 5.6 

417866 AW067903 Hs.82772 collagen, type XI, alpha 1 .5.5 

430019 AA463893 Hs.220933 ESTs 5.5 

439809 R41396 Hs.101774 hypothetical protein FU23045 5.5 

423811 AW299598 Hs.50895 noroeoboxC4 5.4 

434539 AW748078 Hs.214410 ESTs, Weakly similar to MUC2JIUMAN MUCIN 5.4 

439138 AI742605 Hs.193696 ESTs 5.4 

453931 AL121278 Hs.25144 ESTs 5.4 

444078 BE246919 Hs.10290 U5 snRNP-specific 40 kOa protein (hPrp8- 5.4 

447102 BE167434 Hs.98471 ESTs; Weakly similar to T1 8712 hypothefl 5.4 

451621 A1879148 Hs.26770 fatty acid binding protein 7, brain 5.4 

425235 AW067800 Hs.155223 stenniocafcin 2 5.3 

421464 AA291553 Hs.190086 ESTs 5.3 

450736 AW970060 gb:EST382140 MAGE resequences, MAGK HomoSJ 

42B085 AA421081 Hs.1 2388 ESTs 5.3 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 5.3 

445424 AB028945 Hs.12696 cortactin SH3 domain-binding protein 5.3 

456938 X52509 Hs.161640 tyrosine aminotransferase 5.3 

422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.2 

438167 R28363 Hs.24286 ESTs 5.2 

433330 AW207084 Hs.132816 hypothetical protein MGC14801 5.2 

449765 N92293 Hs,206832 ESTs, Moderately similar to ALU8_HUMAN A 5.2 

416276 U41060 Hs.79136 LIV-1 protein, estrogen regulated 5.2 
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400300 X03363 — HER2 receptor tyrosine kinase (c-erb-b2, 5.2 

418004 U37519 Hs.87539 aldehyde dehydrogenase 3 family, member 5.2 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 5.2 

428771 AB028992 Hs.193143 KIAA1069 protein 5.2 
455047 AW852530 gb:PM1-CT02434)71099-001-g06CT0243Homo5.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypothetl 5.2 

453197 AI916269 Hs.109057 ESTs, Weakly similar to ALU5.HUMAN ALUS 5.1 

400298 AA032279 Hs.61635 sb< transmembrane epithelial antigen of 5.1 

431023 AI283133 Hs.297420 ESTs 5.1 

427666 AI791495 Hs.180142 caimoduUn-Uke skin protein 5.1 

427718 AI798680 Hs.25933 ESTs 5.1 

434531 AA642007 Hs. 116369 ESTs 5.1 

429220 AW207206 Hs.136319 ESTs 5.1 

405494 NA C2001837*:gl|12697903|db]lBAB21770.1|(A 5.1 

452930 AW195285 Hs.194097 ESTs, Weakly similar to 138022 hypothetl 5.1 

444910 AI201849 . gb:qs76g04j(1 NCLCGAP J>r28 Homo sapiens 5.1 

453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.0 

444381 BE387335 Hs.283713 ESTs, Weakly similar to S64054 hypothetl 5.0 

450603 R43646 Hs.12422 ESTs 5.0 

416575 W02414 Hs.38383 ESTs 5.0 

438504 AW665281 Hs.224625 ESTs 5.0 

416209 AA236776 Hs79078 MAD2 (mitotic arrest deficient yeast, h 5.0 

428804 AK000713 Hs.193736 hypothetical protein FU20706 5.0 

420077 AW512260 Hs.87767 ESTs 4.9 

450480 X82125 Hs.25040 zinc finger protein 239 4.9 

437637 AJ003029 Hs.65792 syntrophin, gamma 2 4.9 

431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth .4.9 

418836 AI655499 Hs.161712 ESTs 4.8 

442441 AI820662 Hs.129598 ESTs 4.8 

435635 AF220050 Hs.181385 uncharacterized hematopoietic stem/proge 4.8 

400286 NA C16000922:gl|7499103|plr||T20903 hypothe 4.8 

407506 U71600 gb:Human zinc finger protein zfp31 (zf31 4.8 

420026 AI831190 Hs.166676 ESTs 4.8 

441377 BE218239 Hs.202656 ESTs 4.8 

457726 AI217477 Hs.194591 ESTs 4.8 

412785 AW997556 Hs.78521 KIAA1717 protein 4.8 

428358 BE440042 Hs.83326 matrix metaBoproteinase 3 (stromelysin 4.7 

436026 AI349764 Hs.217081 ESTs 4.7 

409110 AA191493 Hs.48778 niban protein 4.7 

400284 NA estrogen receptor 1 4.7 

410102 AW248508 Hs.279727 Homo sapiens cDNA FU14035 fis, clone HE 4.7 

407819 R42185 Hs.274803 ESTs 4.7 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fern 4.7 

422896 AW961489 Hs.154116 ESTs 4.7 

453616 NMJJ03462 Hs.33846 dyneln, axonemal, light intermediate pot 4.7 

427427 AF077345 Hs.177936 ESTs 4.6 

421751 AW813731 Hs.159153 ESTs, Moderately similar to S65657 alpha 4.6 

454074 R63503 Hs.28419 ESTs 4.6 

405718 . . C4000799*:gi|6330365|db|lBAA86508.1|(AB 4.6 

444649 AW207523 Hs.197628 ESTs 4.6 

429431 240313 Hs.106330 Homo sapiens clone IMAGE:23371, mRNA seq 4.6 

427811 M81057 Hs.1 80884 carboxypeptidase B1 (tissue) 4.6 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 4.6 

430345 AK000282 Hs.239681 hypothetical protein FU20275 4.6 
454307 AW855717 gb:RC1-CT0279481299-013-b01 CT0279 Homo 4.6 

400303 AA242758 Hs.79136 LIV-1 protein, estrogen regulated 4.6 

438180 AA808189 Hs.272151 ESTs 4.6 
451340 AW936273 gb:QVO^T0020^0200-107-g07 DT0020 Homo 4.6 

458711 AL036877 Hs.282878 ESTs 4.6 

457430 AA514660 Hs.128443 ESTs 4.6 

416030 H15261 Hs.21948 ESTs 4.6 

447233 AW246333 Hs.17901 Homo sapiens, clone IMAGE:393701 5, mRNA, 4.6 

445537 AJ245671 Hs.12844 EGF-like-domaln. multiple 6 (EGFL6) 4.5 

424590 AW966399 Hs.46821 hypothetical protein FU20086 4.5 

432374 W68815 Hs.301885 Homo sapiens cDNAFU 11346 fis, clone PL 4.5 

423833 AW503329 gb:UI-HF-8N0-akx^-02-(WJl.r1 NIH_MGC_50 4.5 

406747 AI925153 Hs.217493 annexlnA2 4.5 
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412102 H56435 gb;yq98e09.f1 Soares fetal liver spleen 4.5 

431716 D89053 Hs.268012 fatty-acid-Coenzyme A ligase, long-chain 4.5 

41 1050 AW814902 gb:MR1-ST02Q6-1 20400-022-108 ST0206 Homo 4.5 

401418 NA C140Q0338*:gi|7459502|pIrj|S74665 outer 4.5 

436194 AK001074 Hs.333435 Homo sapiens cONA FU10212 fe, clone HE 4.5 

436211 AK001581 Hs.334828 hypothetical protein RJ10719; KIAA1794 4.4 

414080 AA135257 Hs47783 B aggressive lymphoma gene 4.4 

424115 AA335497 Hs.293965 ESTs, Weakly similar to I38022 hypotheti 4.4 

415786 AW419198 Hs.257924 hypothetical protein FL) 13782 4.4 

442117 AW664964 Hs.128899 ESTs 4.4 

452784 BE463857 Hs.151258 hypothetical protein RJ21052 4.4 

432731 R31178 Hs.287B20 fibronectinl 4.4 
410534 AW905138 gb:QV0-NN1071 -280400-207^07 NN1071 Homo4.4 

405196 NA , C2000662*:gl|7512792|pir||T12482hypothe 4.4 

430217 N47863 Hs.336901 ribosomal protein S24 . 4.4 

401793 C17001545:gi|5360127|gb|AAD42882.1lAF155 4.4 

415747 AA381209 gb:EST94257 Activated T-cells I Homo sap 4.4 

423679 AB007975 Hs.1 31454 KIAA0506 protein 4.4 

400238 NA , C19000274*:gi|12741327|ref|XP„008833^| 4.4 

425627 AF019612 Hs.297007 membrane-bound transcription factor prot 4.4 

400608 i C10001899:gi|7508633|pir||T25392hypothe 4.4 

458634 AV657310 Hs.282898 ESTs 4.3 

407771 AL138272 Hs.62713 ESTs 4.3 

405906 NA Target Exon 4.3 

405925 NA TaigetExon 4.3 

439382 BE247684 Hs.1 03070 ESTs 4.3 

445263 H57646 Hs.42586 KIAA1560 protein 4.3 

407162 N63855 Hs.1 42634 zinc finger protein 4,3 

419536 AA603305 gb:np12d11.s1 NCLCGAP„Pr3 Homo sapiens 4.3 

454359 N71277 gb:za36e03.s1 Soares fetal liver spleen 4.3 

411558 AA102670 Hs.70725 gamma-amirtobutyr1cacW(GABA)Arecepto 4.2 

450715 AI266484 Hs.31570 ESTs, Weakly similar to K1AA1324 protein A2 

421451 AA291377 Hs.50831 ESTs 4.2 

452864 AA033714 Hs.287629 hypothetical protein FLM260 4.2 

409757 NM001898 Hs.123114 cystatinSN 4.2 

413043 BE158766 gb:IL2-HT0397-O71299-024-F02 HT0397 Homo 4.2 

413499 BE144884 gb:CM0W01 82-041 099-065*11 HT0182 Homo4.2 

444619 BE538082 Hs.8172 ESTs, Moderately similar to A46010 X-Un A2 

408380 AF123050 Hs.44532 diubiquitin 4.2 

406992 S82472 gb:beta -pol=DNA polymerase beta {exon a 42 

404285 NA C6001909:gi|704441|dbJ|BAA18909.1|P298 4.2 

425247 NMJJ05940 Hs.155324 matrix metaCoproteinase 11 {MMP11;stro 42 

428046 AW812795 Hs.155381 ESTs. Moderately similar to I38022 hypot 42 

446163 AA026880 Hs.25252 prolactin receptor 4.2 

421147 AW592167 Hs.293299 ESTs 4.2 

426451 AI908165 Hs.169946 GATA-bindlng protein 3 (T-cell receptor . 4.2 

415227 AW821113 Hs.72402 ESTs 42 

452176 AA024538 Hs.282990 Human DNA sequence from clone RP1^28H20 42 

452862 AW378065 Hs.8687 ESTs 4.2 

443646 AI085198 Hs.164226 ESTs 4.2 

425523 AB007948 Hs.158244 KIAA0479 protein 4.1 

424687 J05070 Hs.1 51738 matrix metalloproteinase 9 (gelatinase B 4.1 

430009 AA894564 Hs.22242 ESTs 4.1 

434469 AA634806 gb:ab28c02.r1 Stratagene hmg (937210) H 4.1 

451381 BE241831 Hs.172330 hypothetical protein MGC2705 4.1 

450229 R18717 Hs.8929 hypothetical protein FU11362 4.1 
455700 BE068115 gb:CM1-BT0368-061299-060^07BT0368Homo4.1 

431924 AK000850 Hs.272203 Homo sapiens cDNA FU20843 fis, clone AD 4.1 

438885 AIB86558 Hs.184987 ESTs 4.1 

401451 NfvL004496*:Homo sapiens hepatocyte nude 4.1 

431676 AI685464 gb^t88f04a1 NCLCGAP_Pr28 Homo sapiens 4.1 

409092 AI735283 Hs.172608 ESTs 4.1 

429270 W60379 Hs.57773 ESTs 4.1 

443903 AI220547 Hs.135223 ESTs • 4.1 

427.122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-efb-b2, 4.1 

410275 U85858 Hs.61796 transcription factor AP-2 gamma (acHvat 4.1 

432912 BE007371 Hs.200313 ESTs 4.1 
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4035B5 Target Exon ■ 4.1 

438295 AI394151 Hs.37932 ESTs 4.1 

420380 AA640891 Hs.102406 ESTs 4.1 

431118 BE264901 Hs.250502 carbonic anhydrase VIII 4.1 

416182 NM_004354 Hs.79069 cydtnG2 , 4.1 

418994 AA296520 Hs.89546 seisctin E (endothelial adhesion molecut 4.1 

400555 Target Exon 4.1 

410079 U94362 Hs.58589 g!ycogenln2 4.0 

427674 NM.003528 Hs.2178 H2B histone family, member Q 4.0 

427131 AA448460 Hs.1 12017 GE36gene 4.0 

439759 AL359055 Hs.67709 Homo sapiens mRNA full length insert cDN 4.0 

429353 AL1 17408 Hs.200102 ATP-binding cassette transporter MRP8 4.0 

421296 NMJXJ2666 Hs.103253 perilipln 4.0 

418819 AA228776 Hs.191721 ESTs 4.0 

424188 AW954552 Hs.142634 zinc finger protein 4.0 

455431 AW938484 gb:CMO-DT0057-290200-253-d06 DT0057 Homo4.0 

404142 NA Target Exon 4.0 

441143 AI027604 Hs.159650 ESTs 4.0 

444540 AI693927 Hs.265165 ESTs 4.0 

415579 AA165232 Hs.222069 ESTs 4.0 



452891 N75582 Hs.212875 ESTs, Weakly similar to DYH9_HUMAN CIUA 4.0 
414605 BE390440 gb:601283601F1 NIHJMGCjW Homo sapiens c 4.0 



452281 T93500 Hs.28792 Homo sapiens cDNA FLI11041 lis, clone PL 4.0 

417801 AA417383 Hs.82582 integrin, beta-like 1 (with EGF-like rep 4.0 

446232 AI281848 Hs.194691 retinoic acid induced 3 4.0 

447377 X77343 Hs.334334 transcription factor AP-2 alpha (activat 4.0 

437854 AL119723 gb:DKFZp761A2124„r1 761 (synonym: hamy2) 4.0 

446140 AA356170 Hs.26750 hypothetical protein FU21908. 4.0 

452240 AI591147 Hs.61232 ESTs 4.0 

459574 AI741122 Hs.101810 Homo sapiens cDNA FU 14232 fis, clone NT 4.0 

458673 N99626 gb:za39d11.rl Soares fetal fiver spleen 4.0 

444858 AI199738 Hs.208275 ESTs, Weakly similar to ALU/LHUMAN 111! 4.0 

452166 AI948607 Hs.264680 ESTs 4.0 

452681 AF153330 Hs.30246 solute carrier family 19 (thiamine trans 3.9 

450192 AA263143 Hs.24596 RAD51 -interacting protein 3.9 

405554 NA Target Exon 3.9 

416259 AA573006 Hs.19173 ESTs 3.9 

445813 Z42023 Hs.106576 aianine-glyoxylate aminotransferase 2-li 3.9 

451024 AA442176 gb:zw63b08.r1 Soares_totaLfetus.Nb2HF8_ 3.9 

413930 M86153 Hs.75618 RAB11A, member RAS oncogene family 3.9 

401781 Target Exon 3.9 

415296 F05086 Hs.328142 ESTs 3.9 

452564 AA026777 gb:ze93c11.r1 Soares_fetaLhearLNbHH19W 3.9 

442500 AI819068 Hs.209122 ESTs 3.9 

419759 Z21336 Hs.135411 actin related protein 3.9 

424638 AI472106 Hs.49303 Homo sapiens cDNA FU11663 fis, clone HE 3.9 

439699 AF086534 Hs.187561 ESTs, Moderately similar to ALU1.HUMAN A 3.9 

428042 AA419529 Hs.76391 myxovlms (influenza) resistance 1, homo . 3.9 

452501 AB037791 Hs.29716 hypothetical protein FU10980 3.9 

453049 BE537217 Hs.30343 ESTs 3.9 

443213 BE568414 Hs.145497 Homo sapiens cDNA: FU22097 fis, clone H 3.9 

443489 AI073512 Hs.133916 ESTs 3.9 

455092 BE152428 gb:CMO-HT0323-151299-126-b04HT0323Homo3.9 

401785 NM_002275*:Homo sapiens keratin 15 (KRT1 3.9 

426427 M86699 Hs.1 69840 TTK protein kinase 3.9 

446009 AI989885 Hs.231926 ESTs 3.9 

436033 H75391 Hs.255748 ESTs 3.9 

451067 BE172186 gb:MRO-KT0559-11030O^05-h11 HT0559 Homo3.8 

419348 AA236645 Hs.98274 ESTs 3.8 

444635 AI184268 Hs.339665 ESTs 3.8 

412140 AA219691 Hs.73625 RAB6 Interacting, kinesln-like (rabklnes 3.8 

403593 NA Target Exon 3.8 

442323 AW016669 Hs.29190 ESTs 3.8 

419854 AW664873 Hs.87836 Homo sapiens PAC clone RP5-1087M19 from 3.8 

433871 W02410 Hs.205555 ESTs 3.8 

445253 AI217928 Hs.144762 ESTs 3.8 

409542 AA503020 Hs.36563 hypothetical protein FU2241B 3.8 
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443162 


T49951 


458194 


AW383618 


422475 


AL359938 


440705 


AA904244 


447290 


AI476732 


403426 




427821 


AA470158 


454288 


BE222648 


443801 


AW206942 


410658 


AW1 05231 


410672 


AW794600 


428579 


NMJJ05756 


445495 


BE622641 


447995 


AI742618 . 


401747 




420633 


NNL014581 


423545 


AP000692 


1 433138 


AB029496 


434715 


BE005346 


428664 


AK001666 


450951 


AA018534 


402696 


NA 


446868 


AV660737 


458154 


AW816379 


422026 


U80736 


419440 


AB020689 


421524 


AA312082 


417283 


N62840 


401508 


NA 


410303 


AA324597 


420362 


U79734 


433384 


AI021992 


434302 


AA629065 


443938 


R55373 


448420 


BE623004 


458712 


AI347502 


433404 


T32982 


405232 




430491 


AL109791 


455609 


BE011668 


450164 


AI239923 


453948 


AI970797 


436061 


A1248584 


401049 


NA 


418867 


D31771 


420179 


N74530 


458663 


AV658444 


437259 


AI377755 


428309 


M97815 


450522 


AI698839 


451952 


AL120173 


412209 


AW901456 


425201 


AA352111 


443830 


AI142095 


439255 


BE164500 


414869 


AA157291 


409064 


AA062954 


407721 


Y12735 


445135 


AK000054 


404091 


NA 


409731 


AA125985 


405153 




423248 


AA380177 


403639 


NA 


404360 




422352 


AA766296 


423338 


AB007961 



Hs.9029 

Hs.265459 

Hs.117313 

Hs.153205 

Hs.263912 

Hs.98202 
Hs.279458 
Hs.253594 
Hs.192035 

Hs.184942 

Hs.38489 

Hs.181733 

Hs.274480 

Hs.129781 

Hs.59729 

Hs.1 16410 

Hs.189095 

Hs.103334 

Hs.135100 

Hs.335018 

Hs.110826 

Hs.90419 

Hs.105445 

Hs.48648 

Hs.21851 
Hs.97206 
Hs.124244 
Hs.1 16301 
Hs.20864 

Hs.107872 
Hs.102720 

Hs.241559 

Hs.30098 
Hs.64859 
Hs.190745 

Hs.89404 

Hs.21168 

Hs.280776 

Hs.120695 

Hs.183650 

Hs.301663 



Hs.143273 

Hs.21479 
Hs.141883 
Hs.38018 
Hs.12347 

Hs.56145 

Hs.125845 



Hs.99200 
Hs.127338 



DKFZP434G032 protein -3.8 
ESTs, Moderately similar (o ALU2JWMAN A 3.8 
Meis (mouse) homoiog 3 3.8 
ESTs ' 3.8 
ESTs 3.8 
Target Exon 3.8 
ESTs 3.8 
ESTs, Highly similar to c38QA1.1bIH.sap . 3.8 
ESTs 3.8 
ESTs 3.8 
gb:RC64JM0014-17030M22-C05 UM0014 Homo3.8 



G protein-coupled receptor 64 
ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to nrtrflase homolo 
Homo sapiens keratin 17 (KRT17) 
odorant-binding protein 2A 
chromosome 21 open reading frame 5 
semaphorin sem2 
ESTs 

similar to SALL1 (sal (Drosophitayike 
ESTs 



3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 



C3002523:gi(668621 1 |sp|Q27533[YH2IVLCAEEL3.7 
ESTs 3.7 
ESTs 3.7 
trinucleotide repeat containing 9 3.7 
KIAA0882 protein ■ 3.7 

GDNF family receptor alpha 1 3.7 
ESTs 3.7 
NrVL024817:Homo sapiens hypothetical prot 3.7 
Homo sapiens cDNA FU12900 fis, clone NT 3.7 
huntingtln interacting protein 1 3.7 
ESTs 3.7 
ESTs 3.7 
ESTs 3.7 
gb:601441282F1 NIH_MGC_72 Homo sapiens c 3.7 
hypothetical protein FU20761 3.7 
ESTs 3.7 
NM_015832:Homo sapiens methyWpG blndin 3.7 
Homo sapiens mRNA full length Insert cON 3.7 
gb:CM3-BN0223-100500-177-a04 BN0223 Homo3.7 



ESTs 
ESTs 



3.7 
3.7 
3.7 
3.6 
3.6 
3,6 
3.6 
3.6 
3.6 



Homo sapiens cONA: FU21326 fis, clone C 
Target Exon 

msh(Dro$ophila)homeo box homoiog 2 
ESTs 

tankyrase, TRF1-lnteracting ankyrin-reta 
ESTs 

cellular retinoic acid-binding protein 2 
gb:wd31f02jc1 Soares_NFUJ_GBC_S1 Homos3.6 

ESTs 3.6 
gb:RCO-NN1012-270300-031-c07 NN1012Homo3.6 

gbf ST6O061 Activated T-cells XX Homo sa 3.6 

ESTs 3.6 
gb:RC4-KTO469-23030Q-O14-e10 HT0469 Homo 3.6 

ublnuclein 1 3.6 

ESTs 3.6 

duatepedrldtytyroslne-{Y)-phosphofyl 3.6 

hypothetical protein FU20047 3.6 

Target Exon 3.6 

thymosin, beta, Identified in neuroblast 3,6 

Target Exon 3.6 

ribulose-6-phosphate-3-epImerase 3.6 

ENSP00000233023*:CDNAFU12662fis ( clon 3.6 

C7001385:gl|12082809|gbJMG48618.1jAF315 3.6 

ESTs 3.6 

K1AA0492 protein 3.6 
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424202 BE350295 Hs.15032 RAN binding protein 17 3.6 

431750 AA514986 Hs.283705 ESTs 3.6 

439907 AA853978 Hs.124577 ESTs 3.6 

453596 AA441838 Hs.62905 hypothetical protein FU14834 3.6 

406446 NA Target Exon , 3.6 

418454 AA315308 Hs.1 95870 hypothetical protein FU14991 3.6 

434360 AW015415 Hs.127780 ESTs 3.6 

409079 W87707 Hs.82065 interieuldn 6 signal transducer (gp130, 3.6 

440132 AI697121 Hs.202466 ESTs, Weakly similar to S65824 reverse t 3.6 

448706 AW291095 Hs.21814 Interieukin 20 receptor, alpha 3.6 

440671 AW297920 Hs.130054 ESTs 3.6 
407647 AW860158 gb:RCO-Ct0379-290100432-b04 CT0379 Homo 35 

459023 AW968226 Hs.60798 ESTs 35 

402820 NIVL017646*:Homo sapiens tRNAisopentenyl 3.5 

417009 AA191719 Hs.314714 ESTs 35 

447048 AW393080 Hs.228320 hypothetical protein FU23537 3.5 

449978 AI806335 Hs.200829 ESTs, Weakly similar to T30171 nineln - 3.5 

426062 AA420683. Hs.98321 hypothetical protein FU14103 3.5 

452909 NM015368 Hs.30985 pannexinl 35 

400610 NA Target Exon 3.5 

417843 W07361 Hs.22545 Homo sapiens cDNA FU12935 (is, done NT 3.5 

419335 AW960146 Hs:284137 hypothetical protein FU 12888 3.5 

451592 AI805416 Hs.213897 ESTs 3.5 

443270 NMJIQ4272 Hs.337737 Homer, neuronal immediate early gene, IB . 3.5 

423948 AW392342 Hs.283077 centrosomal P4.1-assodated protein; unc 3.5 

449424 AW448937 Hs.197030 ESTs 35 
423841 AW753967 gb:RC2-CT03O4-080100-O1 14i12 CT0304 Homo.35 

416806 NIVL000288 Hs.79993 peroxisomal biogenesis factor 7 35 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5_H(JMAN A 3.5 

433104 AL043002 Hs.128246 ESTs, Moderately similar to unnamed prat 35 

415778 H84847 Hs.49391 hypothetical protein LOC54149 35 

413054 AW316843 Hs.66309 hypothetical protein MGC1 1061 35 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 35 

424639 AI917494 Hs.9812 Homo sapiens cDNA FU14388 fis, clone HE 35 

424827 AI057094 Hs.96867 Homo sapiens cDNA: FU23155 fis, done L 35 

437762 AI370876 . Hs.79090 exportin 1 (CRM1, yeast, homolog) 35 

411514 AW850178 gb:IL3<;T0219-27109^022-H12CT0219Homo 3.5 

413783 AA314337 Hs.301547 ribosomal protein S7 35 

421106 AA877124 Hs.172844 ESTs 35 

431291 N25521 Hs.25275 KruppeWype zinc finger protein 35 

440623 AI935016 Hs.216639 ESTs 3.5 

455838 BE145808 gb:MRO-HT0208-101299-1 03411 HT0208 Homo 3.5 

458771 AW295151 Hs.163612 ESTs 35 

442942 AW167087 Hs.131562 ESTs 3.5 

436550 Z50158 Hs.270235 ESTs, Weakly similar to MMHUB1 laminin b 35 

418849 AW474547 Hs.53565 Homo sapiens PIG-M mRNA for mannosyltran 35 

.424420 BE614743 Hs.146688 prostaglandin E synthase 35 

430916 AW505021 Hs.88414 BTB and CNC homology 1, basic leucine zi 35 

432030 AI908400 Hs.143789 ESTs 35 

439405 AF086224 Hs.55238 ESTs 35 

405917 NA C17000675:gi|7290703|gb|AAF46150.1| (AE0 35 

452727 AW993582 Hs.176220 ESTs 35 

426320 W47595 Hs.169300 transforming growth factor, beta 2 3.4 

421070 AA283185 Hs.19327 ESTs 3.4 

424625 AW904466 Hs.321197 PD2 domain protein (Drosophfla InaD-fike 3.4 

428508 BE252383 Hs.184668 SBBI31 protein 3.4 
455651 BE064962 gb:RC1-BT0313-1304QQ-€1&cO2 BT0313 Homo 3.4 

410555 U92649 Hs.64311 a dlsintegrin and metalloproteinase doma 3.4 

447754 AW073310 Hs.163533 Homo sapiens cDNA FU14142 fis, clone MA 3.4 
418636 AW749855 gb:QV4-BT0534-281299-053-c05BT0534 Homo 3.4 

404097 NA C5000242':gi|9369379|gb|AAF87l28.1|ACQ06 3.4 

434205 AF1 19861 Hs.283032 hypothetical protein PRO2015 3.4 

421072 AI215069 Hs.89113 ESTs 3.4 

402421 NA C1001578*:gi|6759903|gb|AAF28099.1|(AF1 3.4 

405248 NA Target Exon 3.4 

407638 AJ404672 Hs534483 hypothetical protein FU23571 3.4 

403000 BE247275 Hs.151787 U5 snRNP^pecifrc protein, 116 kD 3.4 
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433393. AF038564 Hs.98074 Itchy (mouse homolog) E3 ubiquitin prate 3.4 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (coiiagenase 3.4 

458747 BE618395 Hs.257391 hypothetical protein DKFZp761J1 523 3.4 

442082 R41823 Hs.7413 ESTs; calsyntenin-2 3.4 

417974 AA210765 gb:zr90c06.r1 NCLCGAPJ3CB1 Homo sapiens 3.4 

446002 AI346468 Hs.145789 ESTs 3.4 

448995 AI613276 Hs.5662 guanine nucleotide binding protefo (G pr 3.4 

436007 AI247716 Hs.232168 ESTs 3.4 

424698 AA164366 Hs.151973 hypothetical protein FU23511 3.4 

435202 AI971313 . Hs.170204 K1AA0551 protein 3.4 

410467 AF1 02546 Hs.63931 dachshund (Drosophila) homolog 3.3 

405460 NA Target Exon 3.3 

441826 AW503603 Hs.129915 phosphotriesterase related 3.3 

453472 AL037925 gbiDKFZp564M037_r1 564 (synonym: htbr2) 3.3 

447078 AW885727' Hs.301570 ESTs 3.3 

441690 R81733 Hs.33106 ESTs 3.3 

420092 AA814043 Hs.88045 ESTs 3.3 

418478 U38945 Hs.1174 cydin-dependent kinase Inhfoltor 2A {me 3.3 

408908 8E296227 Hs.250822 serine/threonine kinase 15 3.3 

414737 AI160386 Hs.125087 ESTs 3.3 

449650 AF055575 Hs.23838 calcium channel, voltage-dependent L ty 3.3 

418912 NM 000685 Hs.89472 angiotensin receptor 1 3.3 

436405 AA160079 Hs.172932 Homo sapiens mRNA for partial 3UTR, seq 3.3 

453911 AW5Q3857 Hs.4007 Sarcolemma^associated protein 3.3 

409361 NMJ)05982 Hs.54416 sineocuIishomeobox{Drosophiia)homoIo 3.3 

429548 AW138872 Hs.135288 ESTs 3.3 

420807 AA280627 Hs.57846 ESTs 3.3 

409695 AA296961 . gb:EST1 12514 Adrenal gland tumor Homo sa 3.3 

445189 AI936450 Hs.147482 ESTs . 3.3 

402892 NA Target Exon 3.3 

426681 AA994896 Hs.22514 ESTs 3.3 

458722 AA741545 Hs.282832 ESTs, Weakly similarto T24961 hypotheti 3.3 

409430 R21945 Hs.1 66975 splicing factor, arginlne/serine-rich 5 3.3 

443194 AI954968 Hs.279009 matrix Gla protein 3.3 

445432 AV653771 gb:AV653771 GLC Homo sapiens cDNA done 3.3 

410908 AA121686 Hs.10592 ESTs 3.3 

406151 NA Target Exon 3.3 

436461 AW511956 . Hs.293261 ESTs . 3.3 

411171 AW820260 gb:QV2-ST0296-150200-040-c10 ST0296 Homo 3.3 

432415 T16971 Hs.289014 ESTs, Weakly similar to A43932 mucin 2 p 3.3 

439310 AF086120 Hs.102793 ESTs 3.3 

401575 NA Target Exon 3.3 

420900 AL045633 Hs.44269 ESTs . 3.3 

445628 AI344166 Hs.155743 ESTs . 3.3 

448243 AW369771 Hs.52620 integrin, beta 8 3.3 

445102 AW2O4610 Hs.22270 ESTs 3.3 

442118 AA976718 Hs.202242 ESTs 3.3 

416783 AA206186 Hs.79889 monocyte to macrophage drffererrtiation-a 3.3 

435039 AW043921 Hs.130526 ESTs 3.3 

451474 T70874 Hs.207636 ESTs 3.2 

442559 T10213 Hs.159993 gycosyftransferase 3.2 

453921 AI824009 Hs.44577 ESTs 3.2 

420036 R60336 Hs.52792 Homo sapiens mRNA; cDN A DKFZp586M 823 {f 3.2 

435627 W88774 Hs.1 18370 ESTs. 3.2 

411598 BE336654 Hs.70937 H3 histone family, member A 3.2 

446733 AA863360 Hs.26040 ESTs, Weakly similar to fatty acid omega Z2 

410153 BE311926 Hs.15830 hypothetical protein FLJ1 2691 3.2 

403637 NA C3001106*:gi|10047201|dbj)BAB13394.1|(A 3.2 

405547 NfvL018833*:Homo sapiens transporter 2, A 3.2 

427878 C05766 Hs.161022 CGI-07 protein 3.2 

451871 AI821005 Hs.1 18599 ESTs 3.2 

410313 R10305 Hs.185683 ESTs 3.2 

416856 N27833 Hs.269028 ESTs, Weakly similar to 138022 hypotheti 3.2 

449490 AI652777 Hs.197069 ESTs 12 

450506 NM_004460 Hs.418 fibroblast activation protein, alpha 3.2 

440684 AI253123 Hs.127356 ESTs, Highly similar to S21424 nestin [H 3.2 

459055 N23235 Hs.30567 ESTs. Weakly similar to B34087 hypotheti 3.2 
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452190 
430965 
405394 
424693 
454265 
437687 
428372 
414083 
411670 
416283 
437488 
428398 
452042 
421477 
438078 
448816 
419519 
404580 
447046 
457473 
429838 
459702 
400195 
417860 
417995 
422589 
435870 
440801 
426274 
423728 
439677 
452834 
431349 
417576 
430264 
418827 
410835 
426269 
405336 
437783 
440931 
455945 
430437 
405848 
455685 
406970 
409602 
423518 
425653 
426326 
433805 
437152 
448602 
452844 
407366 
408254 
424085 
416790 
420020 
426119 
426968 
457421 
453403 
454141 
426650 
450865 
407993 



H26735 
AA489732 

BE1 69810 

H03556 

AA765917 

AK000684 

AL121282 

AW856552 

NW1005429 

AA758239 

AI249368 

H 38857 

AI904743 , 

AI016377 

AB033052 

AI198719 

AA326187 

AW974903 

AW904907 

AI204995 

NA 

AW408557 

AW974175 

AA312735 

AA701327 

AA906366 

D38122 

AW891294 

R82331 

AI638627 

AA503653 

AA339449 

AA470519 

BE327311 

AW806906 

H15302 

NA 

AI683150 

AI583052 

BE160636 

AI768801 

NA 

BE066976 

M29994 

W26713 

D45027 

AI065104 

BE165753 

AA706910 

AL050027 

AI541305 

AW407181 

AF026942 

AW807227 

NMJKJ2914 

R83066 

BE295866 

W94997 

U07616 

AL117431 

BE466639 

AW138413 

AA382814 

AI248013 

AW135274 



Hs.91668 
Hs.154918 

Hs.47557 

Hs.300949 

Hs.122840 

Hs.183887 

Hs.257786 

Hs.79141 

Hs.180330 

Hs.98558 

Hs.243901 

Hs.104650 

Hs.131693 

Hs.22151 

HS.176376 

Hs.17170 

Hs.291231 

Hs.?0732 



Hs.235498 

Hs.188751 

Hs.30512 

Hs.17949 

Hs. 190535 

Hs.2007 

Hs.132136 

Hs.164599 

Hs.105685 

Hs.156942 

Hs.82285 

Hs.47166 

Hs.168950 

Hs.201550 
Hs.270058 

Hs.169943 



Homo sapiens done PP1498 unknown mRNA 3.2 



ESTs 

Target Exon 
ESTs 

ESTs, Weakly similar to thyroid hormone 
ESTs 

hypothetical protein FU22104 
ESTs 



3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 



Hs.256972 
Hs.129732 
Hs.249718 
Hs.250528 
Hs.112742 

Hs.48778 
•HS.21B377 



Hs.139226 

Hs.7043 

Hs.94362 

Hs.189917 

Hs.173034 

Hs.112165 

Hs.61779 

Hs.139336 

Hs.106532 
Hs.12433 



gb:RC1-CT0294-080100-012-a04 CT0294 Homo3.2 

vascular endothelial growth factor C 3.2 

ESTs 3.2 

ESTs , 3.2 

Homo sapiens cDNA FU20738 (is, clone HE 3.2 

hypothetical protein FU10292 3.2 

ESTs 3.2 

KIAA1226 protein 3.2 

ESTs 3.2 

NM.0141 12*:Homo sapiens trichorhlnophala 3.2 

G protein-coupled receptor 4 3.2 

ESTs 3.1 

hypothetical protein HJ13409; WAA1711 3.1 

gb:an03c03j(1 Stratajgene schizo brain S1 3.1 

NM_007057*:Homo sapiens ZW10 Interactor 3.1 

hypothetical protein FU14075 3.1 
ESTs, Weakly similar to MAPB_HUMAN MICRO 3.1 

Homo sapiens mRNA for KIAA0556 protein, . 3.1 

ESTs 3.1 

ESTs 3.1 

tumor necrosis factor (ligand) superfami 3.1 

solute carrier family 4, sodium bicarbon 3.1 

ESTs 3.1 

KIAA1688 protein . 3.1 

ESTs, Moderately similar to ALU2JWMAN A 3.1 

phosphoribosyiglycinamide formyltransfer 3.1 

gb:nc71f10.s1 NCI_CGAP_Pr1 Homosapiens 3.1 

HT021 3.1 
gb:QV4-ST0023-16040Q-172-d12 ST0023 Homo 3.1 
Homo sapiens mRNA; cONA DKFZp566A1046 (f 3.1 

Target Exon 3.1 

ESTs, WeaWy similar to ALU1JHUMAN ALU S 3.1 

ESTs 3.1 
gb:PM1-HT0422-291299-002-c08 HT0422 Homo3.1 

Homo sapiens cDNA FU 13569 fis, clone PL 3.1 

Target Exon 3.1 
gb:PM0-BT0340-21 12994)03^2 BTO340 Homo 3.1 

gb:Human alphas spectrin gene, exon 12. 3.1 

ESTs 3.1 

R3H domain (binds single-stranded nuclei 3.1 

ESTs, Weakly similar to A46010 X-linted 3.1 

Homo sapiens, clone IMAGE:4098694, mRNA, 3.1 

ESTs 3.1 
gbrHomo sapiens mRNA; cDNA DKFZp566C03243.1 

niban protein 3.1 

Homo sapiens cONA RJ11927 fis, clone HE 3.1 

gbrHomo sapiens cig33 mRNA, partial sequ 3.1 
gb:MR4-ST0062.1 80200-001-e10 ST0062 Homo 3.1 

replication factor C (activator 1)2 (40 3.1 

succinate-CoA figase, GDP-forming, alpha 3.1 

adenosine kinase 3.1 

ESTs 3.1 

amphiphysln (Stiff-Mann syndrome with br 3.1 

Homo sapiens cDNA FU12198 fis, clone MA 3.1 

Homo sapiens cDNA FU13591 fis, done PL 3.1 

ATP-blnding cassette, sub-family C (CFTR 3.1 

gb:EST96097 Testis I Homo sapiens cONA 5 3.1 

ESTs, WeaWy similar to I38588 reverse t 3.1 

ESTs 3.1 
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446466 
457886 
420058 
409248 
452747 
418926 
419346 
429826 
435147 
420139 
405609 
404274 
449777 
415459 
415245 
406291 
414210 
432055 
442246 
451353 
451177 
416026 
401326 
409920 
432887 
411789 
401045 
432162 
434627 
442611 
425477 
457183 
433014 
415542 
416173 
408155 
453913 
435495 
423629 
411836 
415030 
419606 
440310 
443608 
414140 
444781 
435393 
454071 
446922 
448062 
427711 
450382 
424866 
433043 
420802 
445625 
403677 
411093 
435255 
443127 
448104 
427315 
430414 
423600 
458562 
402109 
429629 



H38026 

BE219794 

AK001423 

AB033035 

BE153855 ' 

AA232658 

AI830417 

N93266 

AL133731 

NNL005357 
NA 

AI971362 

H07118 

N59650 

NA ■ 

BE383592 

AW972359 

AI791988 

N21043 

AI969716 

BE379727 

NA 

BE169746 

AI926047 

AF245505 

AA584062 

AI221894 

BE077155 

AW958879 

H91882 

NMJJ14711 

R13474 

R52782 

AB014528 

AW004683 

AI754212 

AW021173 

AW901879 

D31118 

AW294795 

AA878939 

AI375957 

AA281279 

NMJH4400 

AA701259 

AI041793 

BE1 75605 

AW295923 

M31659 

AA397658 

W01938 

W57554 

U22376 

BE246743 

NA 

BE067650 

W87434 

BE568102 

AI674818 

AA179949 

AW365665 

AI633559 

N34128 

NA 

BE501732 



Hs.308 arrestin 3, retinal (X-arrestin) — 3.1 

Hs.293471 ESTs 3.1 

Hs.94694 Homo sapiens cDNA FLJ10561 fis, clone NT 3.0 

Hs.51965 KIAA1209 protein 3.0 

Hs.61460 Ig superfamtly receptor LNIR 3.0 

Hs. 105794 UDP^Iucose:glycoprotelngtucosyltransfe 3.0 

Hs.44143 polybromol 3.0 

Hs.40747 ESTs 3.0 
Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f 3.0 

Hs.95351 Qpase, hormone-sensitive 3.0 

ENSP00000241065*:CDNA 3.0 

NMJ02944*:Homo sapiens v-ros avian UR2 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

Target Exon 3.0 
gb:601297871F1 NIHJV1GCJ9 Homo sapiens c 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

ESTs 3.0 

fatty acid binding protein 4, adipocyte 3.0 

C10000447*:gi|1 168375|sp|P4346j7|AGA1J>ED 3.0 

Ckety ortholog of mouse Arkadia 3.0 

ESTs " 3.0 

DKFZP564I1922 protein 3.0 

C11001883*:gil6753278|rBf|NP„033938.1| c 3.0 

hypothetical protein FU20413 3.0 

ESTs 3.0 

hypothetical protein DKF2p761B1514 3.0 

ESTs 3.0 

Dvi-binding protein IOAX inhibition of 3.0 

KIAA0419 gene product 3.0 

ESTs, Weakly similar to I38022 hypotheti 3.0 

gb:yg99d09.r1 Soares infant brain 1NIB H 3.0 

KIAA0628 gene product 3.0 

mutS (E. cob) homolog 2 (colon cancer, 3.0 

Homo sapiens Xq pseudoautosomai region; 3.0 

Homo sapiens cDNA: FLJ21909 fis, clone H 3.0 

ESTs 3.0 

hypothetical protein MGC10520 3.0 

ESTs 3.0 

ESTs 3.0 

F-box only protein 22 3.0 

hypothetical protein FU14681 3.0 

GPI-anchored metastasis-associated prate 3.0 

ESTs 3.0 

ESTs 3.0 
gbiRCWfT0580-100500-022-H07 HT05B0 Homo3,0 

KIAA1843 protein 3.0 

solute carrier family 25 (mitochondrial 3.0 

Homo sapiens cDNA FU13598 fis, clone PL 3.0 

ESTs, Weakly similar to ALU7 J1UMAN AUU S 2.9 

lymphoid nuclear protein (LAF-4) mRNA 2.9 

v-myb avian myeloblastosis viral oncogen 2.9 

hypothetical protein FU22635 2.9 

C4001462:gil4887715|gb|AAA79329.2| (L088 2.9 
gb:MR4-BT0358-O90300-003-eO1 BT0358Homo£9 

ESTs, Moderately similar to ALU1.HUMAN A 2.9 

mitochondrial ribosomal protein S 16 2.9 

Homo sapiens cDNA FU11375 fis, clone HE 2.9 
Homo sapiens mRNA; cDNA DKFZp564N0763 (f 2.9 



Hs.231945 

Hs.6099 

Hs.27252 



Hs.293334 

Hs.129115 

Hs.42932 

Hs.13034 

Hs.83213 

Hs.12504 

Hs.162859 

Hs.72157 

Hs.272798 

Hs.39311 

Hs.177537 

Hs.270535 

Hs.118569 

Hs.279912 

HS290263 

Hs.43133 

Hs.78934 

Hs.21951 

Hs.18612 

Hs.314453 

Hs.191735 

Hs.198529 

Hs.125406 

Hs.289074 

Hs.23317 

Hs.11950 

Hs.189299 

Hs.42502 

Hs.255472 

Hs.180408 

Hs.60257 

Hs.337243 

Hs.125019 

Hs.1334 

Hs.288529 



Hs.106015 
Hs.180312 
Hs.316433 
Hs.175563 
Hs.120388 
Hs.310359 
Hs.145268 



ESTs 2.9 

ESTs 2.9 

ESTs 2.9 

Target Exon 2.9 

Hs.30622 Homo sapiens cDNA FU13010 fis, done NT 2.9 
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442295 
419752 
404721 
445107 
401987 
430566 
444517 
445563 
427691 
456561 
401458 
421039 
459504 
424962 
409617 
416931 
413221 
409732 
433687 
434340 
454529 
421379 
452291 
457402 
449051 
408761 
401093 
435061 
447985 
449340 
426384 
411905 
405953 
420854 
434265 
428365 
442861 
448337 
452554 
412248 
449450 
434757 
409038 
454545 
439842 
428479 
424800 
411086 
400250 
449168 
456482 
426044 
431854 
405873 
440400 
458265 
413708 
423739 
424408 
453095 
421825 
417742 
402765 
444378 
419172 
401497 
402376 



AI827248 Hs.224398 Homo sapiens cONA FU11469 fis, clone HE 2.9 
AA249573 Hs.152618 ESTs, Moderately similar to ZN91.HUMAN Z 2.9 

NM_CQ5596*:Homo sapiens nuclear factor I 2.9 
ESTs, Weakly similar to I38022 hypotheti 29 
NMJ)02737*:Homo sapiens protein kinase C 2.9 
Hs.190149 ESTs £9 
Hs.146883 ESTs 2.9 
Hs.149006 ESTs . .2.9 
Hs.20726 ESTs 2.9 
Hs.246358 ESTs, Weakly similar to T32250 hypotheti 29 
Target Exon 2.9 
Hs.101299 cullin 5 • 2.9 
gb:601315974F1 NIHJAGC J Homo sapiens cD 2.9 
Hs.153954 TRAM-like protein 2.9 
Hs.55209 Homo sapiens mRNA; cONA DKF2p434K0514 (f 2.9 
Hs.80485 adipose most abundant gene transcript 1 2.9 
gb:PMO-HT0425-141299-001-F08 HT0425 Homo2.9 
Hs.56148 NY-REN-58 antigen 2.9 
gb:ny57g01.s1 NCLCGAPJ>r1 8 Homo sapiens 2.9 
Hs.128685 ESTs, WeaWy similar to T17226 hypotheti 2.9 
Hs.270425 ESTs 2.9 
Hs.103982 small inducible cytokine subfamily B (Cy 2.9 
Hs.28853 CDC7(ceH division cycle 7, S.cerevisl 29 
Hs.149342 activation-Induced cytidine deaminase 2.9 
Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 2.9 
Hs.238936 ESTs, WeaWy similar to (define not ava 2.9 
C12000586*:gl|6330167|dbj|BAA86477.1| (A 29 
Hs.163944 ESTs 2.9 
1^3 200949 ESTs 2.9 
Hs!l95359 hypothetical protein MGC10954 2.9 
Hs 303662 ESTs 29 
gb:601 193893F1 NIHJAGCJ Homo sapiens cO 29 
Target Exon 2.8 
gb:UI-H-BWT>aJw>07-(KJI.s1 NCI_CGAP_Su 28 
Hs.130554 Homo sapiens cONA: FU23089 fis, clone L 28 
Hs.183861 Homo sapiens cDNA FU20042 fis, done CO 2.8 
Hs.57787 ESTs 28 
Hs.3782 ESTs 2.8 
Hs.58006 ESTs, Weakly similar to ALU5 JiUMAN ALU S 28 
gb:RC3-HT0585-16030(W)22-c02 HT0585 Homo 28 
Hs.49136 ESTs, Moderately similar to ALU7_HUMAN A 28 
Hs.132921 ESTs 2.8 
Hs.50002 small inducible cytokine subfamily A (Cy 28 
gb:QV4^T0023-160400-172-c12 ST0023 Homo 2.8 
Hs.132413 ESTs 2.8 
Hs. 184572 cell division cycle 2, G1 to S and G2 to . 28 
Hs.153203 MyoD family Inhibitor 28 
gbBC3-BT0502-25119M11-c07 BT0502 Homo 28 
Eos Control 2.6 
Hs.23142 colon carcinoma related protein - 28 
gbaa41b12.s1 NCLCGAPJ3CB1 Homosapiens2.8 
Hs.336695 ESTs 28 
Hs.271699 polymerase (DNA directed) lota 28 
Target Exon 28 
Hs.1 25594 ESTs, Weakly similar to T25472 hypotheB 28 
Hs.1 28193 ESTs, Weakly similar to IRX2.HUMAN IROQU 28 
gb:IL2-HT0397-091299-025MD02 HT0397 Homo 28 
Hs.97600 ESTs . 28 

Hs.146428 collagen, type V, alpha 1 28 
Hs.1 1325 ESTs 28 
Hs.183747 ESTs, Moderately similar to CALB_HUMAN C 2.8 
gb:EST22d11 WATM1 Homo sapiens cONA clon 28 
C1003621*:gi)12407405|gb)AAG53491.1|AF22 28 
Hs.12569 ESTs 28 
Hs.22120 ESTs . 2.8 
Target Exon 2.8 
C19000763*:gi|1363912lplr||JC4296 ring f 2.8 
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AA481282 

AI939339 

AW873606 

AW194426 

AI868634 

NWL003478 

BE514127 

NfVL012288. 

BE003760 

D45371 

BE161151 

NWL016122 

AA743991 

AI193043 

245439 

Y15221 

AF015592 

AW452648 

AW961400 

AA057264 

AI651474 

AI681475 

AW235786 

AI472078 

BE265067 

NA 

AW296927 

AA846811 

AA295331 

AA243837 

AW206453 

AW452434 

BE176480 

AL039852 

AI038997 

T97490 . 

AW806899 

AI910896 

Y00272 

AL035588 

BE070800 

NA 

NMJH6206 

AA485224 

AA502490 

AA383550 

NA 

AA994364 

AI075375 

BE158791 

AA398155 

AI754813 

AW294631 

AA298758 

R64719 

R41339 . 
AW338625 
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405041 NA C3001706*:giI1345652|sp|P15989|CA36_CHIC 2.8 

40875B NMJJ03686 Hs.47504 exonudeasel 2.8 

431917 D16181 Hs.2868 peripheral myelin protein 2 2.8 

437583 AA761190 Hs.244627 ESTs 28 

5 453737 AA744862' Hs.194293 ESTs, Weakly similar to I54374 gene NF2 2.8. 

458094 AF086325 gb:Homo sapiens full length Insert cDNA 2.8 

401283 NA Target Exon 2.8 

410784 AW803201 gb:IL2-UM0077-070500^)80^06 UM(X)77 Homo 2.8 

417601 NM.014735 Hs.82292 KIAA0215 gene product 2.8 

10 418236 AW994005 Hs.337534 ESTs 28 

435532 AW291488 Hs.1 17305 Homo sapiens, done IMAGE:3682908, mRNA 2.8 
454714 AW815098 gb:GV4-ST0212-091199-023-f10 ST0212 Homo 2.8 

418629 8E247550 Hs.86859 growth factor receptor-bound protein 7 2.8 

442101 AI651930 Hs.135684 ESTs 2.8 

15 405080 AK000375 Hs.88820 HDCMC28P protein 2.8 

414661 T97401 Hs.21929 ESTs ZB 

425589 AI650633 Hs. 143688 Homo sapiens cDNA: FU23031 fis. clone L 2.8 

429638 AI916662 Hs.211577 klnectln 1 (klnesin receptor) 2.7 

428824 W23624 Hs.173059 ESTs 2.7 

20 421566 NMJ00399 Hs.1395 early growth response 2 (Krox-20 (Drosop 2.7 
414596 BE386870 gb:601275271F1 NIHJ/IGC_20 Homo sapiens c 27 

440868 R79707 Hs.263339 ESTs, Moderately similar to I38022 hypot 2.7 

452943 BE247449 Hs.31082 hypothetical protein FU10525 2.7 

443772 AV646449 Hs.282872 ESTs 2.7 

25 432361 AI378562 Hs.159585 ESTs 27 

430375 AW371048 Hs.93758 H4 hfetone family, member H 2.7 

406504 NA C5000558:gi|4504675|reflNP_002175.1|lnt 27 

423279 AW959861 Hs.290943 ESTs 2.7 

424871 NNL004525 Hs.153595 low density lipoprotelrHelated protein 2.7 

30 453619 H87648 Hs.33922 Homo sapiens, clone MGC5084, mRNA, comp 2.7 

423961 D13666 Hs. 136348 osteoblast specific factor 2 (fascldln 2.7 

422156 N34524 gb:yy56d10.s1 Soaresjnultiple_sclerosis_ 2.7 

415752 BE314524 Hs.78776 putative transmembrane protein 2.7 

419987 NlvL005014 Hs.94070 osteomodulin 2.7 

35 406182 NA Target Exon 2.7 

416495 X69970 Hs.79350 RYKreceptor-Jike tyrosine kinase 2.7 

444701 AI916512 Hs.198394 ESTs • 2.7 

408171 AA301228 Hs.43299 hypothetical protein FU1 2890 2.7 

430153 AW968128 Hs.336679 ESTs 2.7 

40 413383 AA128978 Hs.154706 hypothetical protein FU14917 27 

414831 M31158 Hs.77439 protein kinase, cAMP-dependent regulato 27 

413276 BE563085 Hs.833 interferon-stimulated protein, 15 kOa 2.7 

433132 AB026264 Hs.284245 hypothetical protein IMPACT 2.7 

437030 AA742577 Hs.303781 EST 2.7 

45 439031 AF075079 gb:Homo sapiens full length Insert cDNA 2.7 

449532 W74653 Hs.271593 ESTs, Moderately similar to A47582 B-cel 2.7 

406153 Target Exon 2.7 

406625 Y13647 Hs. 1 19597 stearoyl-CoA desaturase (delta-9-desatur . 2.7 

444698 AI188139 Hs.147050 ESTs 2.7 

50 432328 AI572739 Hs.195471 6-prKKphofro<^2-kinase/fructose-2,6^i 2.7 

429628 H09604 Hs. 13268 ESTs 2.7 

420149 AA255920 Hs.88095 ESTs 2.7 

431207 AA495925 Hs.9394 ESTs 2.7 

438394 BE379623 Hs.27693 peptidylprolyl Isomerase (cydophiRnH 2.7 

5 5 443304 AI050073 Hs.135338 ESTs 27 

427660 AI741320 Hs.114121 Homo sapiens cDNA: FU23228 fis, clone C 2.7 

408460 AA054726 Hs.285574 ESTs 27 

416515 N91716 Hs.194140 ESTs, Weakly similar to 138022 hypothetl 2.7 

429922 297630 Hs.2261 17 H1 histone family, member 0 2.7 

60 418203 X54942 Hs.83758 CDC28 protein kinase 2 2.7 

439509 AF086332 Hs.58314 ESTs 2.7 
402184 NA ENSP00000245238*:CDNA FLJ10922 fis, don 2.7 

450496 AW449251 Hs.257131 ESTs 27 

451963 AI825440 Hs.224952 ESTs 2.7 

65 457938 AI373638 Hs.133900 ESTs 27 

441541 AA938663 Hs.199828 ESTs 2.7 

441111 AI806867 Hs.126594 ESTs 2.7 
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423020 AA283092 Hs.1608 replication protein A3 (14kD) 2.7 

445354 AV653485 Hs.6390 Homo sapiens done FLB3344 PRO0845 mRNA, 2.7 

427961 AW293165 Hs.143134 ESTs . 2.7 

410869 X91662 Hs.66744 twist (Drosophl)a) homotog (acrocephabs 2.7 

445234 AW137636 Hs.146059 ESTs 2,7 

413903 AA496493 Hs.23136 ESTs . . 2.7 

408089 NA Target Exon 2.7 

447410 AI470235 Hs.172698 EST .27 

401256 NA NM_024089*:Homo sapiens hypothetical pro 2.7 

415139 AW975942 Hs.48524 ESTs. 27 

420218 AW958037 Hs.286 ribosomal protein L4 2.7 
455511 BE144762 gb:CMC^HT0180^1099^65-D04HTT)180Homo27 

438825 BE327427 Hs79953 ESTs 26 

409564 AA045857 Hs.54943 fracture callus 1 (rat) homotog 2.6 

452837 AL121053 ' Hs.5534 Homo sapiens cDNA FU12961 ffe, done NT 26 

434876 AF160477 Hs.61460 lg superfamlly receptor LNIR 2.6 

421565 AK001122 Hs.105859 hypometical protein FU 10260 2.6 

453279 AW893940 Hs.59698 ESTs 2.6 

430785 Z30201 gb:HHEA22G Atrium cONA library Human hea 2.6 

456386 D38299 Hs.1 70917 prostaglandin E receptor 3 (subtype EP3) 2.6 

433068 NM_006456 Hs.208215 sfalyltransferase 2.6 

421952 AA300900 Hs.98849 ESTs, Moderately similar to AF16151 1 1 H 2.6 

429208 AA447990 Hs.190478 ESTs 2.6 

430733 AW975920 Hs.283361 ESTs 2.6 

441720 AI346487 Hs.28739 ESTs 2.6 

418986 AI123555 Hs.81796 ESTs .2.6 

432481 AW451645 Hs.151504 Homo sapiens cDNA FU11973 fis, clone HE 26 
434338 AW754311 . gb:CM1-CT0337-141299-068-ID7 CT0337 Homo 26 

417061 AI675944 Hs,188691 Homo sapiens cONA FLJ12033 fis, done HE 26 

410530 M25809 Hs.64173 ATPase, H transporting, lysosomal (vacuo 26 

456672 AK002016 Hs.1 14727 Homo sapiens, clone MGC:16327, mRNA, com 2.6 

425071 NMJ13989 Hs.154424 deiodinase, lodothyronine. type II 2.6 

408868 AW292286 Hs.255058 ESTs 26 

451531 AA018311 Hs.114762 ESTs 2.6 

405822 Target Exon 2.6 

418301 AW976201 Hs.53913 hypothetical protein FU10252 2.6 

417315 AIO80O42 Hs.336901 ribosomal protein S24 26 

434699 AA643687 Hs.149425 Homo sapiens cDNA FU11980 fis, clone HE 26 

443204 AW205878 Hs.29643 Homo sapiens cDNA FU13103 fis, clone NT 26 

405638 Target Exon 2.6 
452542 AW812256 gb:RC0-ST0174-19109*O31-a07STO174 Homo 2.6 

403943 C5000355:gi|4503225|reflNP.000765.1| cyt 2.6 

404535 Z25884 Hs.121483 chloride channel 1 , skeletal musde (Th 26 

402800 NA Target Exon 2.6 

449144 AI989503 Hs.233405 ESTs 2.6 

454934 AW846080 Hs.314324 ESTs 26 

424717 H03754 Hs.1 52213 wingless-type MMTV Integration site fami 2.6 

428303 AW974476 Hs.183601 regulator of G-proteln signalling 16 26 

427970 AA418187 Hs.330515 ESTs 26 

450638 AK001826 Hs.25245 hypothetical protein FU 11 269 26 

453034 BE246010 Hs.271468 Homo sapiens mRNA for FU00038 protein, 2.6 
455097 AW855802 gb:RC1-CT0279-170200423-d08 CT0279 Homo 26 

427317 AB028955 Hs.175780 KIAA1 032 protein 26 

408875 NIYL015434 Hs.48604 DKFZP434B1 68 protein 26 

427510 Z47542 Hs. 17931 2 small nudear RNA activating complex, po 26 

423201 NM_000163 Hs.125180 growth hormone receptor 26 

406271 Target Exon 2.6 

442696 BE566962 Hs.7063 Homo sapiens cDNA: FU20913 fis, done A 2.6 

454018 AW016892 Hs.100855 ESTs 2.6 

435420 AI928513 Hs.59203 ESTs 2.6 

434398 AA121098 Hs.3838 serum-indudble kinase 2.6 
455708 BE069326 gbrQV3-BT0381-170100-060-g03 BT0381 Homo 26 

439347 W24320 Hs.102941 Homo sapiens cDNA: FU21531 frs, done C 2.6 

407523 X64984 gb'Hsaplens mRNA HTPCRX10 fbrolfectory 26 

425101 AA830431 Hs.180811 ESTs 26 

435153 AA668763 Hs.291939 ESTs 26 

409139 AI681917 Hs.3321 ESTs, Highly similar to IRX1_HUMAN IROQU 2.6 
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455100 
414612 
440283 
423025 
431473 
404440 
403388 
403775 
405037 
407447 
420952 
435447 
440202 
445854 
421247 
414870 
457411 
424676 
404443 
452268 
430832 
444779 
408633 
459089 
447197 
454111 
411465 
406922 
456045 
413111 
423123 
432201 
427032 
445417 
422225 
428330 
410011 
426310 
419589 
437770 
442818 
414251 
428301 
431956 
455732 
458624 
428257 
418054 
458652 
458012 
422996 
410804 
457211 
440029 
448141 
409163 
431385 
414569 
439963 
455935 
425025 
416589 
404826 
422938 
421991 
417404 
448516 



BE160198 
BE274552 
AI732892 
M831267 
AA825686 ' 

NA 
NA 
NA 

AF290544 

AA282067 

AI872932 

AW516211 

AI702885 

BE391727 

N72264 

AW085961 

Y08565 ' 

NMJJ03512 

AI073913 ' 

AH92105 

AW963372 

F13036 

R36075 

AW081681 

NWL000169 

S70284 

H62943 

BE065837 

{WL012247 

AI538613 

AF012023 

AK001058 

BE245652 

L22524 ■ 

AB02Q641 

NNL000909 

AW973708 

AA767881 

AK001741 

AL042306 

AW628666 

AK002032 

BEO80908 

AI362790 

BE394723 

NM.002318 

AW375610 

AI424899 

BE091089 

U64820 

AW972565 

AW089705 

AM71598 

AA065081 

BE178536 

AF109298 

AW247529 

BE1 58687 

AW953168 

AA652687 

NMJJ01809 
NWL014918 
NJVL007350 
AW898595 



gb:QVl4fT0413-O1020(K)59-h03 HT0413 Homo 2.6 

Hs.76578 protein inhibitor of activated STAT3 2.6 

Hs.190489 ESTs 2.6 

Hs.12244 hypothetical protein FU20097 2.6 

Hs.321176 ESTs, Weakly similar to S65824 reverse t 2.6. 

. NM_021048:Homo sapiens melanoma antigen, 2.6 

C300139B*:gi|12248917|dbj|BAB20375.1| (A 2.6 

TaigetExon 26 

NM_021628*:Homo sapiens arachidonate lip 2.6 

gb:Homo sapiens aminopeptidase mRNA, par 2.6 

Hs.88972 ESTs, Moderately similar to A46010 X-8n 2.6 

gb:wm72e03.x1 NCI_CGAP_Ut2 Homo sapiens 2.6 

Hs.1 25300 ring finger protein 21, Interferon-respo 2.6 

Hs.145568 ESTs 26 

Hs.102910 general transcription factor IIH, polype 2.6 

Hs.300670 KIAA1204 protein 2.6 

Hs.130093 ESTs 2.6 

Hs. 151 678 UDP^-acetyJ-alpha-D-galactoam&rcpolyp 2.6 

C8001428*:gi|6572242iemb|CAB62951.1| (Z9 2.6 

Hs.28777 H2Ahistone family, member L 2.6 

Hs.100686 ESTs, Weakly similar to JE0350 Anterior 2.6 

Hsii47170 ESTs 2.6 

Hs.46677 PRO2000 protein 2.6 
Hs.27373 Homo sapiens mRNA; cONADKFZp56401763(f 2.6 

gb:yh88b01 .s1 Soares placenta Nb2HP Homo 2.6 

Hs.269064 ESTs, Weakly similar to T42689 hypothetl 2.6 

Hs.69089 galactoskJase, alpha 2.6 

gbstearoyl-CoA desaturase [human, adipo 2.6 

Hs.154188 ESTs 2.6 

gb:RC2-BT0318-1 1010O012-g12 BT0318 Homo 2.6 

Hs.124027 SELENOPHOSPHATE SYNTHETASE ; Human selen2.6 

Hs.298241 Transmembrane protease, senna 3 2.5 

Hs.173274 integrin cytoplasmic domain-associated p . 2.5 

Hs.12680 Homo sapiens cDNA FU 10196 lis. clone HE 2.5 

Hs. 118281 zinc finger protein 266 2.5 

Hs.2256 matrix metalloprotelnase 7 (MMP7; uterin 2.5 

Hs.57856 PFTAIRE protein kinase 1 2.5 

Hs.169266 neuropepfideY receptor Y1 2.5 

Hs.201925 Homo sapiens cDNA FU13446 fis, clone PL 25 

Hs.122897 ESTs 2.5 

Hs.8739 hypothetical protein FU10879 25 

Hs.97689 VASA protein 2.5 

Hs.98440 ESTs, Weakly similar to 138022 hypotheti 2.5 

Hs.272245 Homo sapiens CDNAFU1 1170 fis, clone PL 2.5 

gb:QV1-BT0631-28020M84-h07BT0631 Homo 2.5 

Hs.278639 KIAA1684 protein; likely homolog of mous 2.5 

Hs.275243 S100 calcium-binding protein A6 (calcycl . 2.5 

Hs.63354 lysyf oxfdase-fike 2 2.5 

Hs.117102 hypothetical protein FU13046 similar to 25 

Hs.188211 ESTs 2.5 

gb:PM4-BT0724-13(MOO<00&<*)7 BT0724 Homo 2.5 

Hs.66521 Machado^seph disease (spinocerebellar 25 

Hs.32399 ESTs, WeaWy similar to S51797 vasodilat 2.5 

Hs.293711 ESTs, Weakly similar to S64329 probable 25 

Hs.197531 ESTs 2.5 

gb:zm13a03.3l Stratagene pancreas (93720 2.5 

Hs.1 1090 membrane-spanning 4-domains, subfamily A 2.5 

Hs.1 18258 prostate cancer associated protein 1 . 2.5 

Hs.6793 platelet-activating factor acetythydrola 2.5 

gb:CM0-HT0395-280100-169-b09 HT0395 Homo2.5 

Hs.12407 ESTs 2.5 

Hs.96151 Human DNA sequence from clone RP5-1 103G7 2.5 

TaigetExon 2.5 

Hs.1 594 centromere protein A (17kD) 2.5 

Hs.1 10488 KIAA0990 protein 25 

Hs.82101 pteckstrfn homology-Jike domain, family 25 

gb:RC1-NN0073-26040(W)11-g09 NN0073 Homo25 
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403356 
404^83 
418282 
427409 
431806 
443367 
421246 
439217 
400925 
404552 
417168 
418841 
426853 
427738 
457384 
447128 
454693 
434657 
402077 
400289 
.409723 
447020 
455068 
431232 
408938 
411571 
426504 
428248 
408813 
423504 
425441 
443G66 
443556 
428943 
425320 
430388 
423242 
416241 
440244 
409239 
452464 
410718 
408877 
445150 
407756 
407633 
449754 
419316 
429118 
440331 
449344 
459006 
423165 
411337 
438290 
406414 
424498 
443464 
424856 
440304 
409045 
422648 
426819 
412520 
430602 
408031 
403133 



NA 

AA215535 
AW467143 
AF186114 
AW071349 
AW582962 
AF086041 

NA 

AL133117 

NNL002332 

U32974 

NM_000318 

AA501760 

AI271898 

AW813428 

AA641876 

NA 

X07820 

AW885757 

T27308 

A1807894 

AI024353 

AA059013 

AA122393 

AW162919 

A1126772 

AI58Q090 

N80077 

AA449644 

AW297921 

AA256769 

AW086180 

U29344 

AA356923 

ALO39402 

N52639 . 

AI743977 

AA740875 

AW500507 

A1920783 

AA479033 

AI446747 

AA1 16021 

NWL007069 

H00620 

AA236255 

H20669 

AL046412. 

AI640355 

AW298631 

AI937547 

AW837349 

AA843719 

AB033043 

BE548446 

AA347746 

BE159984 

AA635062 

D86983 

AL135623 

AA442324 

D13752 

AA081395 



ENSP000D0251525*:Hypothetical protein Kl 2.5 
ENSP00000252242*:Keratin l type II cytosk 2.5 
Hs.98133 ESTs . 2.5 
Hs.135411 actln related protein 2.5 
Hs.270737 tumor necrosis factor (flgand) superfami 2.5 
Hs.215937 ESTs 2.5 
Hs.102897 CGW7 protein 2.5 
Hs.42975 ESTs 2.5 
Target Exon 2.5 
ENSP00000220888*21NC FINGER TRANSCRIPT125 
Hs.81376 Homo sapiens mRNA; cDNA DKFZp586L1 121 (f 2.5 
Ms.89137 low density llpoprotein-related protein 2.5 
Hs.172777 bacufoviral IAP repeat-containing 4 2.5 
Hs. 180612 peroxisomal membrane protein 3 (35kD,Ze 2.5 
Hs.15806 Homo sapiens mRNA; cONA DKFZp434H2019 (f 2.5 
Hs.164866 cycUnK 2.5 
gb:MR3-ST0192-010200-210-c05 ST0192 Homo 2.5 
Hs.191840 ESTs £5 
Target Exon 2.5 
matrix metafioprotetnase 10 (MMP10; str 2.5 
ESTs 2.5. 
hypothetical protein FU11046 25 



Hs.2258 
Hs.?57862 
Hs.16986 
Hs.47274 
Hs.131755 
Hs.22607 
Hs.70811 
Hs.170160 
Hs.40479 
Hs.48295 
Hs.24792 
Hs.193083 
Hs.255703 
Hs.94949 
Hs.37636 
Hs.83190 
Hs.240770 
Hs.125783 
Hs.32683 
Hs.205144 
Hs.44307 
Hs.192619 
Hs.191435 
Hs.130315 
Hs.338704 
Hs.38260 
Hs.37189 
Hs.30977 
Hs.298419 
Hs.35406 
Hs.202151 
Hs.312691 
Hs.27721 
Hs.124915 

Hs.122341 

Hs.149377 

Hs5167 

Hs.9521 

Hs.125395 

Hs.50094 

Hs.1 18893 

Hs.193914 

Hs.795 

Hs.184927 

H3.42173 



Homo sapiens mRNA; cDNA DKFZp564B176 (fr 2.5 

hypothetical protein FU14298 2.5 

ESTs 2.5 
hypothetical protein FU20516 .2.5 

RAB2, member RAS oncogene femfiyJite 2.5 

ESTs . 2.5 

RNArtelicasefarnfly 2.5 

chromosome 12 open reading frame 5 2.5 

Homo sapiens cDNA FU14201 fis, clone NT 2.5 

ESTs ■ 2.5 

methylmalonyl-CoA epimerase 2.5 

ESTs, Weakly similar to KIAA1 392 protein 23 

fatty acid synthase 2.5 

nuclear cap binding protein subunit 2, 2 2.5 

DEME-6 protein 2.5 

ESTs 2.5 

ESTs 2.5 

ESTs, Moderately similar to I38022 hypot 2.5 

K1AA1600 protein 2.5 

ESTs 2.5 

ESTs, Weakly similar to A47582 &cell gr 2.5 

olfactory receptor, family 7, subfamily 2.5 

ubfquitin specific protease 18 2.5 

similar to rat HREV107 25 

ESTs, Weakly similar to B34087 hypotheti 2.5 

ESTs 25 

ESTs, Highly similar to unnamed protein 2.5 

ESTs 25 

ESTs 2.5 

Wolf-Hirschhom syndrome candidate 1-lik 2.5 

hypothetical protein MGC2601 25 

gb:QmT0038-27O3(K)-106Hrf12LTT0O38Homo 25 

ESTs 2.5 

C5000506*:gl|124941|sp|P18614|ITA1.RATI 25 

hypothetical protein DKFZp761L0424 25 

Homo sapiens mRNA; cDNA DKFZp434F152 (fr 25 

ESTs, Weakly similar to 2N43 JHUMAN ZINC 25 

ESTs 25 
Homo sapiens mRNA; cDNA DKFZp434O0515 (f 25 

Melanoma associated gene 25 

KIAA0575 gene product . 25 

H2A hlstone family, member O 25 

cytochrome P450, subfamily XIB (steroid 25 

Homo sapiens cDNA FU10366 fis, done NT 25 

Target Exon • 2.5 
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413189 BE070231 gb:GV4-8T0407-260100-087-f12 BT0407 Homo 2.5 

400346 AB041269 Hs.272263 Homo sapiens mRNA for keratin 19, partia Z5 

435509 AI458679 Hs.181915 ESTs 2.5 

458145 AI239457 Hs.130794 ESTs 2.5 
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TABLE 19A 

Table 19A shows the accession numbers for those pkeys lacking unigenelD's for Table 19. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: 

CAT number 
Accession: 



Unique Eos probeset identifier number 
Gene cluster number 
Genbank accession numbers 



Pkey 

407647 
407980 
408254 



CAT number Accessions 



409163 110418J 



1007366J AW860158AW862385 AW860159 AW862386AW862341 AW821869 AW821893AW062660AW062656 x 
103087 1 AA046309AI263500AA046397 

1049346 1 AW807227 AW807576 AW807137 AW807157 AW807495 AW807494 AWB07417 AW807083 AW845786 AW845801 
AW807130 AW807335 AW807081 AW807349 AW807339 AW807164 AW807341 AW807224 AW845903 AW177424 
AW807159 AW807123 BE141576 AW807340 AW807334 AW807520 AW807205 AW807505 BE141574 AW807390 
AW807395 AW845789 AW807101 AW807089 AW807519 AW807239 AW807509 AW807356 AW807526 AW807098 
AW807307 AW807153 AW807295 AW807313 AW807322 AW807265 AW807513 AW807516 AW807166 AW807501 
AW807120 AW807168 AW807121 AW807527 AW807151 AW807203 AW807489 AW807511 AW807158 AW845800 
AWB07507 AW845793 AWB45796 AW807296 AW845781 AW807444 AW845871 AW807512 AW807242 AW807141 
AW807522 AW807487 AW807514 AW807142 AW807232 AW807379 AW8071 14 AW807518 AW807199 AW80721 1 
AW807498 AW807086 A W807492 AW8072 1 8 A W807082 AW807525 AW807493 A W807523 AW807087 AW845784 
AW807037 AW807128 AW807080 AW807118 AW645807 AW807524 AW845803 AW807249 AW845795 AW807160 
AW807343 AW807515 AW807233 AW807289 AW177102 AW807352 AW807394 AW177105 AW807176 AW177103 
AW845870 AW177099 AW177101 AW807528 AW807336 AWB07038 AW177100 AW80741 1 AW807088 AW845865 
AW807226 AW807517 AW807397 AW807303 AW807177 AWB07154 AW807136 AW807146 AW807085 AW807521 
AW807488 AW07365 AW807355 AW807223 AW807155 

AA065081 AA075017 M084791 AA071015 AA081560 AA071459 AA545727 AA083100 AA085366 AA115845 AA075457 
AA064704 AA082878 AA075742 AA069162 
409695 1 14876J AA296961 AA296889 AA076945 AA077528 AA077497 
410534 1207247J AW905138AW753008R13818Z43519 
410672 1214882J AW794600AW794730 

410784 1221005 1 AW803201 BE079700 BE062940 

410785 1221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AW803355 
410835 1223785J AW806906 AW808915 AW866460 AW866475 AW866462 AW866448 AW866372 AW866604 

411050 1230330J AW814902 BE156656 BE156667 BE156590 BE156441 BE156447 

411086 1231500J BE070800 AW875226 BE149115 

411093 1231970J. BE067650 AW817053 

411111 1232669J AW818127AW818161 R09719 

411171 1234393.1 AW820260 AW820332 R94406 

411337 1239217J AW837349 AW837355 AW882717 

411514 1248638 1 AW850178 AW850233 AW850445 AW850446 

411670 1253680.1 AW856552AW861101 AW856574 AW861099AW861100 AW856573AW856576 AW856562 
411905 1265181.1 BE265067 BE264978 AW875420 
412102 1277395.1 H56435 H56572 AW892929 
412209 1283610 1 AW901456AW901450 AW901441 
412248 1285000 1 BE176480 AW903298 AW903313 

413043 1346556 1 BE158766 BE061699 BE147360 BE147362 BE061666 BE061697 BE061647 BE061678 

413111 1349546 1 BE065837 BE065805 BE065799 BE065818 BE065839 BE065831 BE065894 BE065789 BE065792 

413189 1352723.1 BEO70231 BE070229 BE070255 

413221 1353887.1 BE161 151 BE162495 BE161002 BE072205 BE160989 BE162482 
413499 1373910J BE1 44884 H97942 

413708 1384140 1 BE158791 BE158806 BE158748 BE158744 BE158740 BE158739 BE15881 1 BE158700 BE158741 BE158683 BE158685 
414210 1426051.1 BE383592 BE261671 
414596 1465004.1 BE386870 Z41986 H08501 
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414605 


1465790.-1 


BE39044O 


415747 


155189.1 


AA381209 AA381245 AA167683 


416173 


1574973.1 


R52782 R17313 H24192 R19876 


417742 


169628ZJ 


. R54719 Z44680 R12451 


417974 


171237.1 


AA210765 T95700 H944O7 


418636 


177402.1 


AW749855 AA225995 AW750208 AVV750206 


419536 


185688.1 


AA603305 AA244095 AA244183 


420854 


197072J 


AW296927 AI684514 AI263168 AA281079 


422156 


212379.1 


N34524 AA305071 AW954803 AA502335 AI433430 AI203597 AW026670 AW265323 AW850787 AA317554 AW993643 






AW835572 AW385512 AI334966 W32951 H62656 H53902 R88904 AW835732 


422996 


223666.1 


BE091089 BE091123 AA319959 


423833 


232451.1 


AW503329 N46610 AA331571 


423841 


232507 J 


AW753967 AA370795 AA331630 AW962550 


423945 


233566~1 


AA410943 AW948953 AA334202 AA332882 


425201 


247933~1 


AA3521 1 1 AW962247 AA429695 


426650 

» A* WW WW 


270283 1 

*» f ViW M 1 


AA382814 AA402411 AA412355 


426878 


273265 1 


BE069341 AW748403 AL044891 AI908240 AA393080 


430264 


315008 1 


AA470519 BE303010 BE302954 BE384120 

p 1 f P ^p^V P ^P ■^^M^V^F^^fP F ^PJ ^^F^p^^p^ ^p^h^f^^ p p^F ^p ^v f w ^^F^p 


430785 


323486 1 


Z30201 AA486132 T72025 


431676 

» W 1 VI V 


336411 1 


AI685464AW971336AA513587AA525142 ' 

p ^ I w\f *f » ^p I m 1 ■ p w • ■ wv p v w i ^p ^P ^f * • » ■ ~ 


433687 


373061 1 


AA743991 AA604852 AW272737 


434338 

» Wl WWW 


383982 1 

WvwVWi*_ t 


AW754311 AA630185 AW803285 

Mill W f W ? 1 r ■» W ■ ■ » • • VW*rW 


434469 


387447.1 


AA634806 C18732 AA729161 AA729860 


435447 


406400 1 


AI872932 AA682306 BE220163 W88695 T81307 H91447 


437152 


43386.1 


AL050027BE089051 


437854 


44418J 


AL1 19723 AL119874 AI909018 U50537 


439031 


46798J 


AF075079 H48601 H48795 


439255 


470321 1 

¥ f WWfc f V 


BE1 64500 AA8321 98 BE164502 


444910 


624951.1 


AI201849 BE069007 AW946544 

P ™ • WV 1 ^F • ^P P^ ^F ^P^P ^P^ ^P W ■ ^ P V ^P ■ ^P V V , f 


445432 


63943 T 


AV653771 BE089370 


446922 


69865 1 


BE175605 Z43529 F06610 BE175602 AV661027 

1 P ^P ^F ^F ^p ■ ^f w • ^f ^f ^p ■ ^f m w ^f ^f f ■ » ^f ^f^ » ^f 


447197 


711623 1 


R36075 AI366546 R36167 


448420 


76273 1 


BE623004 AA380669 BE263627 BE246433 


448516 


766241.1 


AW898595 AW898588 AW898590 AW898663 AW898592 AI525093 . 


450522 


837264J 


AI698839 AI909260 AI909259 


450736 

™VV f vv 


844652 1 


AW970060 AI732366 AI792313 AW839644 

1 If 1 v * WV* W *l ■ Wfcr*P w^^p f »■ ■ «mf | ^p f ■ w p W¥W ■ ■ 


451024 


65565 1 


AA442176AA259181 


451067 


85759.1 


BE172186 AA059279 AA020815 AA013437 


451340 


86640 1 


AW936273 AW340350 AA01 7208 

f v * T WW*** ^f » * » ■ ^p * »f*p*p ^p » » p^p i m 


452542 


921410 1 

W 1 • IV — ■ 


AW812256 AW8 12257 AI906423 AI906422 

f % p ■ ¥pT 1 fcp^fV ? 1 T ■ V i<B#w * r »»*P ^P^P l lav P * p ^F ^f ■ 


452564 

w WfcWW* 


92227 1 


AA026777 N50065 R09961 N54721 


453472 


968371 1 

www* • • 


AL037925 AL037931 AL037957 


454307 

~Ww IfWI 


1106070 1 

I ■ www f ■ 


AW855717 AW362452 AW362443 


454359 

"VlVVV 


1130674 1 

1 1 www • I 


N71277AW390764 


454545 


1223779 1 


AW806899 AW866451 AW866393 AW866297 AW817669 

W % W W VF ^p "F *P VF F P ■ F %F lp lpF ■ ^P ■ p p V F ^F ^F ^F ^F^^F IFF ^P ^F ^F Mw P F ■ F F TP ^F ^F^F 


454693 


1229132 1 


AW813428 AW813444 AW813367 AW613368 AW813429 AW813424 

(Iff VF ■ %F ■ ^F^F P * F F %F P «^ ■ F ■ • * r W ■ ^F ■ W m W F ^F ■ ^^^p ^P ^» » F W F ^F F ^F » ^B^F F T » ^F ™ ^» 


454714 


1230493 1 

1 fc. W Wt W V„, 1 


AW615098 BE154643 BE 154831 

f » P ■ W V ^P7 *F ^F TF ™F !■ ■ ^P » ^P » ^F ^W ■ ^F P ^F ^F F 


455047 


1250536 1 


AW852530 AW852527 AW852526 

■ ¥ F V VVb*ITVV f » ■ F %f FF MB^P M P P * F F W 


455092 


1252971 1 


BE152428 AW855572 AW855607 


455097 


1253130 1 


AW855802 AW855794 AW855797 AW855806 AW855796 AW855808 AW855793 AWB55807 


455100 


1253334.1 


BE 160198 AW935898 T1 1520 AW935930 AW856073 AW861034 


455431 


1289854.1 


AW938484 BE001245 BE001190 


455511 


1321229.1 


BE144762AW979091 


455609 


1337548.1 


BE011668 BE011689 BE011627 BE01 1 679 BE01 1699 BE011678 BE011696 BE011675 BE011622 BE011635 


455651 


1348732J 


BE064962 BE064979 BE064853 BE064857 BE064856 BE064977 BE064960 BE064860 BE064815 BE064957 BE064804 




BE064816 BE064850 BE064806 BE064796 BE064818 BE064975 BE064819 BE064810 BE064668 BE065059 


455685 


1350393.1 


BE066976 BE066928 BE066927 


455700 


1351264.1 


BE068115 BE068104 BE068102 BE068096 BE0881O3 BE068154 BE068198 


455708 


1352232.1 


BE069326 BE069290 BE069352 


455732 


1353874.1 


BE080908 BE072258 BE072190 BE072236 


455838 


1374605J 


BE145808 BE145807 BE181B83 


455935 


1384144.1 


BE1586B7BE158688 


455945 


1385588.1 


BE160636 BE160606 BE160703 


456207 


165078.-1 


AA193450 


456482 


192289.1 


AA485224 AA287308 AA258121 


458094. 


47311.1 


AF086325 W72956 W73221 AA219112 


458673 


679507.1 


N99626 AI302701 
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TABLE 19B 



Table 19B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 19. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

■ 

Pkey: , Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 dgit numbers In this column are Genbank Identifier (Gl) numbers. "Dunham I. et aJ." refers to the pubficatior 

entitled "The ONA sequence of human chromosome 22." Dunham I. et at., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposltlon: Indicates nucleotide positions of predicted exons. 



Pkey Ref 

■ 


Strandi 


NLposition 


400555 9801191 


Minus 


134694-134817 


400608 9887666 


Minus 


96756-97558 


400610 9887671 


Minus 


. M _ A — « — -*- ^ a A IK A A A 0\. A A A * * 

117606-117928,124040-124147 


400925 7651921 


Plus 


38183-38391,43900-44086 


401045 8117619 


Plus 


90044-90184,91111-91345 


401049 7232177 


Plus 


149157-150692 


401093 8516137 


Minus 


22335-23166 


401256 9796573 


Minus 


.45482-45620 


401283 9800093 


Minus 


47256-47456 


401326 9212516 


Minus 


226246-227505 


401418 7452889 


Minus 


124865-125075 


401451 6634068 


Minus 


119926-121272 


401458 9187886 


Plus 


76485-77597 


401497 7381770 


Pius 


92607-92813 


401508 7534110 


Minus 


110779-110983 


401575 7229804 


Minus 


76253-76364 


401747 9789672 


Minus 


118596-118816,119119-119244,119609-119761.120422-120990.130161-130381 > 130468-130593 t 131097- 






131258,131866-131932,132451-132575,133580-134011 


401781 7249190 


Minus 


83215^3435,83531^83656,83740-83901,84237-84393,84955-85037,86290^6814 


401785 7249190 


Minus 


165776-165996,166189-166314,166408-166569,167112-167268,167387-167469,168634-168942 


401793 7263888 


Minus 


102945-103083 


401987 4406829 


Minus 


72893-73021,76938-77049 


402077 8117414 


Plus 


66014-66195 


402109 8131678 


Minus 


171722-171859,173197-173303 


402184 8576001 


Minus 


112844-112986,113505-113636 


402376 9625329 


Minus 


21753-22385 


402421 9796341 


Minus 


4660946662,4675^6811,86293-86346,89776-89829,90^^ 


402578 9884928 


Plus 


66350-66496 


402606 9909429 


Minus 


81747-82094 


402696 732S813 


Minus 


23600-23731 


402765 9367757 


Plus 


109588-109726 


402800 6010175 


Plus 


43921-44049,46181-46273 


.402820 6456853 


Minus 


82274-82443 


402892 8086844 


Minus 


194384-194645 


403133 7331427 


Plus 


38314-38634 


403356 8569930 


Plus 


92839-93036 


403388 9438331 


Plus 


112733-113001,114599-114735 


403426 9719529 


Minus 


157156-158183 


403585 8101208 


Minus 


131266-131769 


403593 6862650 


Minus 


62554-62712,6944W9602 


403637 8671936 


Minus 


142647-142771,145531-145762 


403639 8671948 


Plus 


113234-113326,115186-115287,119649-119786 


403677 7331517 


Minus 


55008-55083,62860-63051 


403775 7770580 


Minus 


102247-102326,103095-103148 


403943 7711864 


Plus 


100742-100904,101322-101503 
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404091 7684554 Minus 82121-83229 

404097 7770701 Plus 55512-55781 

404142 9856692 Minus 80316-80459 

404253 9367202 Minus 55675-56055 

5 404274 9885189 Plus 104127-104318 

404285 2326514 Pius 32282-32416 

404360 9858450 Minus 122873-122966,151324-151469,153093-153253 

404440 7528051 Pius 8043041581 

404443 7579073 Minus 87198-87441 

10 404552 7243881 Plus 19854-20010 

404561 9795980 Minus 69039-70100 

404580 6539738 Minus 240588-241589 

404721 9856648 Minus 173763-174294 

404826 6572184 Plus 47726-48046 

15 404983 4432779 Minus 51178-51374,5200032173 

405037 7543748 Minus 127374-127578 

405041 7547195 Plus 121230-121714 

405095 8072599 ' Plus 138877-139066 

405153 9965565 Minus 175317-175500 

20 405196 7230083 Minus 135716-135851 ' ' 

405232 7249042 . Plus , 125904-126063 

405248 7259728 Plus 637-777 

405336 6094635 Phis 33267-33563 

405394 6624123 Minus 31900-32373 

25 405460 7684569 Minus 5222332389 

405494 8050952 Minus 70284-70518 

405547 1054740 Plus 124361-124520,124914-125050 

405609 5757553 Minus 4281443010,4358343783,4486345033,4642946554,47815480^ 

52918,55469-55601,5711137307,^16938296,60215-60332,6148241727 

30 405638 6289229 Plus 199260-199372,199826-199929 

405654 4895155 Minus 53624-53759 

405718 9795467 Plus 113080-113266 

405822 6273498 Minus 154660-154974,155203-155379 

405848 7651609 Minus 28135-28244 

35 405873 6758747 Minus 32129-32764 

405906 7705124 ■ Minus 10835-11059 

405917 7712162 Minus 106829-107213 

405925 6758795 Plus 129935-130282 

405953 7960374 Minus 65101-65574 

40 406069 9117732 Plus 6888069374 

406151 7144806 Minus 94087-94285 

406153 9929734 Minus 12902-13069 

406182 5923650 Minus 28256-28935 

406271 7534217 Plus 3617946692 

45 406291 5686274 Plus .9562-9867 

406348 9255985 Minus 71754-71944 

406414 9256407 Plus 4959349850 

406446 9454509 Minus 116424-116527,118721-118859,121187-121364 

406504 7711360 Minus . 107068-107277 

50 406554 7711566 Plus 106956-107121 
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TABLE 20: 544 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
LIKELY TO ENCODE EXTRACELLULAR OR CELL- 

5 SURFACE PROTEINS 

Table 20 shows 544 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode extracellular or cell-surface proteins. These were selected as for 
Table 19, except that the ratio was greater than or equal to 3 .0, the "average" normal adult 
10 tissue level was set to the 85 th percentile value amongst 144 non-malignant tissues, and the 
96 th percentile value amongst the 73 breast cancer specimens was greater than or equal 100 
units, and the predicted protein contained a structural domain that is indicative of 
extracellular localization (e.g. ig, fh3, egf, 7tm domains, signal sequences, transmembrane 
domains). The predicted protein domains are noted. 

15 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbajik accession number • 

UnfcjenelD; Unfgene number 
20 Pred.ProtDomains: Predicted Protein Domains 
Unlgene Title: Unigene gene title 
R1: Ratio of 93* percentile of tumor to 85* per^ 



25 Pkey ExAccn UnfgenelD Pred.Prot.Domains 



408591 AF015224 Hs.46452 

400291 AA401369 Hs.190721 
449746 AI668594 Hs.176588 

30 407277 AW170035 Hs.326736 

400292 AA250737 Hs.72472 
424735 U31875 Hs.272499 
426878 BE069341 

428848 NM 000230HS.1 94236 

35 407178 AA195651 Hs.104106 

408000 L11690 Hs.620 

427585 D31 152 Hs.179729 

429441 AJ224172 Hs.204096 

450375 AA009647 Hs.8850 

40 420931 AF044197 Hs.100431 

422109 S73265 Hs.1473 

445730 AI624342 Hs.170042 

451110 AI95504O Hs.265398 

400297 AI127076 Hs.334473 

45 420813 X51501 Hs.99949 

452744 AJ267652 Hs.30504 

424634 NIvL003613Hs.151407 

420757 X78592 Hs.99915 

424399 AI905687 Hs.2533 

50 447350 AI375572 Hs.172634 

456207 AA193450 

431448 AL137517 Hs.334473 

427217 AA399272 Hs.144341 

456938 X52509 Hs.161640 

55 435496 AW840171 Hs.265398 
402578 

453160 AI263307 Hs.239884 

422505 AL120862 Hs.124165 

444342 NM_014398Hs.10887 



SS.Uteroglobi^SS.Uterogtobtn 
TM 

,SS,p450 
TM 

deatt,ZU5JM,Actfvinjecp l pkinase l 

,SS,TM 

TM 

SS.Leptln.SS.Leptin, 
.SS.Dihydroorotase, 

Plectin_repeatSH3 1 spectrin,SS,Plectinjr 
SS l C1q,Collagen,$S,C1q,Collagen, 
.SS.Uterogfobin, 



UnigeneThie 

mammaglobin 1 
ESTs 

ESTs, Weakly similar to CP4Y_HUMAN CYTOC 
Homo sapiens breast cancer antigen NY-6R 
BMP-R1B 1 

short-chain alcohol dehydrogenase family 
gb:QV3^T0381-27010(W)73<fl8BT0381 Homo 
teptin (murine obesity homolog) 
ESTs 

bullous pemphigoid antigen 1 (230/240kD) 
collagen, type X, alpha 1 (Schmid metaph 
lipophilin B (uteroglobin family member) 



l SS l TM,disintegrtnPep_M12B_propep ( Repro a disintegrin and metalloproteinase doma 



small Inducible cytokine B subfamily (Cy 
gastrin-reteasing peptide 
ESTs 

ESTs, Weakly similar to transformations 
hypothetical protein DKFZp56401278 
prolactirvinduced protein 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 
cartilage intermediate layer protein, nu 
androgen receptor (dihydrotestosterone r 
aldehyde dehydrogenase 9 family, member . 
ESTs 

gbzr40e07.r1 Soares_NhHMPu_S1 Homosapi 
hypothetical protein DKFZp56401278 
ESTs 



SS.IL8.SS 
SS,Bombesln,SS 
,SSJM,Cation.efnux 
SS 
TM 
SS SS 

) SS.TM f GNSLSUR4 1 cNMP.bindlng l Rlla 
Ig.tspJ.SSMA 

hormone rec^ndrogen__recep,zf-C4, 
SS 

.pkinase, 

,SSJM,p450,p450 

TM 

SS 



^SSJM.aminotran.l^CaaTierirLC^.term^cad tyrosine aminotransferase 

SS ESTs, Weakly similar to transformation-r 

SS.p450,SS.TM,p450 C1001134:gl|2117372|pir||l65981 fatty ac 

SS H2Bhistone family, member L 

SS ■ programmed cell death 9 (PDCD9) 

Lamp,S$,TM,Lamp, similar to fysosome-assoclated membrane 

304 



R1 

168.6 

73.2 

65.7 

57.6 

55.9 

53.8 

503 

40.8 

393 

37.3 

35.2 

30.0 

25.7 

752 

24.6 

24.1 

24.0 

23.8 

22.8 

22.6 

21.7 

21.7 

20.3 

195 

18.3 

18.2 

18.2 

18.1 

17.9 

17.8 

17.8 

17.7 

17.5 
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449765 N92293 Hs.206832 

428227 AA321649 Hs.2248 

425692 D90O41 Hs.155956 

424001 W67883 Hs.1 37476 

448595 AB014544 Hs.21572 

449448 D60730 Hs.57471 

418007 M13509 Hs.83169 

418994 AA296520 Hs.89546 

453596 AA441838 Hs.62905 
452401 NM.007115HS.29352 

446591 H44186 Hs.15456 

419296 AA236115 Hs.1 20785 

452838 U65011 Hs.30743 

422805 AA436989 Hs.121017 

448390 AL035414 Hs.21068 

447342 AI199268 Hs.19322 

411869 W20027 Hs.23439 

443709 AI082692 Hs.134662 
459587 AA031956 

442580 AT733682 Hs.130239 

400289 X07820 . Hs.2258 

411598 BE336654 Hs.70937 

415263 AA948033 Hs.130853 

433805 AA706910 Hs.1 12742 

407276 AI951118 Hs.326736 

443348 AW873596 Hs,182278 

421037 AI684808 Hs.197653 

424086 AI351010 Hs.102267 

400295 W72838 Hs.2533 

452461 N78223 Hs.1 08106 

427365 AI873274 Hs.190721 

433365 AF026944 Hs.293797 

409269 AA576953 Hs.22972 

432596 AJ224741 Hs.278461 

408771 AW732573 Hs.47584 

432912 BE007371 Hs.200313 

447033 AI357412 Hs.157601 

421155 H87879 Hs.102267 
424905 NM_002497Hs.153704 

425398 AL049689 Hs.156369 

438167 R28363 Hs.24286 
459583 AI907673 
423945 AA410943 

439820 AL360204 Hs.283853 
402606 

445263 H57646 Hs.42586 

430217 N47863 Hs.336901 

447164 AF026941 Hs.17518 

431385 BE178536 Hs.11090 

423887 AL080207 Hs.134585 

415385 R17798 Hs.7535 

425704 U79293 Hs.159264 
429859 NNL007050HS.225952 

425523 AB007948 Hs.158244 

428368 BE440042 Hs.83326 
418912 NMJ)00685Hs.89472 

422026 U80736 Hs.1 10826 

451952 AL120173 Hs.301663 

438199 AW016531 Hs.122147 
400608 

413472 BE242870 Hs.75379 

432374 W68815 Hs.301885 
402408 NA 

445537 AJ245671 Hs.12844 

451621 A1879148 Hs.26770 
405654 NA 

434988 AJ418055 Hs.161160 



SS * - ESTs, Moderately similar to ALU8 JiUMAN A 

SS.IL8, small inducible cytokine subfamily B (CX 

,SSAcetyttransf2 f N-acetyltransferase 1 (arytamlne N-acety 

.pWnase, paternally expressed 10 

LRRCT,LRR,SS,LRRCT,serine_carbpept KIAA0644 gene product 
SS ESTs 

SS,herr»pexin ( Peptidase_M10,SSPeptee_rnatrix metalloproteinase 1 (MMP1; Inters 
SS,lectiiu^sushl,EGF,SS,EGF l lectin_.c,su seiectin E (endothelial adhesion molecul 



SS 

,SS,CUB,XIInk, 
PDZ.SS . 
SS 

ss.ss 

histone.SS.histone.histone 
SS 

.SS.Iipocalin 

.SS.PeptidaseJvll, 

,SS,TM,SNF 

,SS,L1M ( 

SS 



hypothetical protein FU 14834 
tumor necrosis factor, alpha-induced pro 
POZ domain containing 1 
ESTs 

preferentially expressed antigen in mela 
H2A histone family, member A 
hypothetical protein 

Homo sapiens, Similar to RIKEN cDNA 2010 

ESTs 

ESTs 

gb:zk15e04.s1 Soares_pregnant_uterus_NbH 
ESTs 



hemopexin,Peptldase_M10 l SS ( PeptidaseJ/l10matrix metalloproteinase 10 (MMP10; str 



histone 1 SS,histone,histone 
.SS.histone.histone.linkerJiistone 
,SS,RibosomaU-7Ae, 
TM 

.SS.DENN 
SS 

.SS.LysyLoxidase 
SS 

,SS,G9a,PHD, 
TM 

SSTPR 

SSJM.UPFOOie.SSJM^PFOOie 

SS.EGF.vwa.SS.TM.vwa, 

TM,KJetra,ionJrans,SS,TM,KJetra,ionJ 

.SSJM.Folatejamer 

SS 



H3 histone family, member A 
ESTs 
ESTs 

Homo sapiens breast cancer antigen NY-BR 
calmodulin 2 (phosphoryiase kinase, delt 
programmed cell death 9 (PDCD9) 
tysyl oxidase 

aldehyde dehydrogenase 9 family, member 

transcription factor 

ESTs 

ESTs 

hypothetical protein FU 13352 
matrOin 3 

potassium voltage-gated channel, delayed 

ESTs 

ESTs 



SS,LysyLoxidase^idose_epim,Epimerase,S rysyl oxidase 



pkfnase,SS,TM,pWnase,po}yprenyLsynt, 
SS 

,SS,TM,7trrLl,p450,rrm 
.pkinase, 

death,ZU5,TM,Activifuecp,pklnase l 
SS 
SS 

,SSAcyltransferase, 



NIMA (never in mitosis gene a>related k 
hypothetical protein similar to tenascfn 
ESTs 

gb:tl_-BT152-080399-004 BT152 Homo saplen 
gb:zt32h03.ri Soares ovary tumor NbHOT H 
Homo sapiens mRNA full length Insert cDN 
NMJ)24626:Homo sapiens hypothetical prot 
KIAA1560 protein 



,SS,RNA^poU^,RNAj)oU^.Ribc^maLS24e,ribosomal protein S24 



JMJBR 
.SS.TM 

,SS,TM,BRCT,ank^BC - tranABC - tran 
,SS,ForkJiead, 
SS 

.SS,TM,Yj)hosphatase,MAM,fn3, 
,SS ( lamlnin_B,laminin_EGF,laminin - Ntemi 



Homo sapiens cig5 mRNA, partial sequence 
membrane-spanning 4-domains, subfamily A 
DKFZP434G232 protein 
COBW-like protein 

Human done 23948 mRNA sequence 
protein tyrosine phosphatase, receptor t 
KIAA0479 protein 



SS,PeptkraseJv110,hemopexin,SS,Peptldasejnatnx metalloproteinase 3 (stromelysin 



SSJM,7tm_1,SS,TM,7trru1. 
SS 

.SS.pkinase, 
.SSArfGap, 
SS,TM,SS,TM 
SS 
SS 

,SS,carb_anhydrase 
,SS,TM,ras 
SSJipocalinJipocaOn, 
BTB.SS 
SS 



angiotensin receptor 1 
trinucleotide repeat containing 9 
ESTs 
ESTs 

C10001899:gl|7508633|plr|IT25392 hypothe 
solute carrier family 1 (glial high affi 
Homo sapiens cDNA FU1 1346 fis, done PL 
NM_030920*:Homo sapiens hypothetical pro 
EGF-like-domain, multiple 6 (EGFL6) 
fatty add binding protein 7, brain 
C12001521:gi[7513934|pirj[T31081 cca3 pr 
ESTs 



17.3 

17.0 

16.7 

16.5 

16.3 

16.2 

15.7 

15.5 

15.5 

15.0 

14.9 

14.8 

14.7 

14.3 

14.2. 

14.2 

13.9 

13.7 

13.7 

13.5 

13.5 

13.3 

13.2 

13.1 

13.1 

13.0 

12.9 

12.8 

12.7 

12.5 

12.4 

12.3 

12.0 

11.9 

11.9 

11.9 

11.8 

11.8 

11.7 

11.6 

115 

11.5 

11.4 

11.4 

11.3 

11.2 

11.1 

11.1 

11.1 

10.9 

10.9 

10.7 

10.4 

10.3 

10.3 

10.3 

10.3 

103 

10.2 

10.1 

10.0 

9.9 

9.8 

9.7 

9.6 

9.6 

9.6 
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416220 N49776 Hs.170994 
431808 M30703 Hs.270833 
414142 AW368397 Hs.150042 
400298 AA032279 Hs.61635 
418601 AA279490 Hs.86368 
415539 AI733881 Hs.72472 
421451 AA291377 Hs.50831 
429432 A1678059 Hs.202676 
442441 AI820662 Hs.129598 
426429 X73114 Hs.169849 
406687 M31126 Hs.272620 
448693 AW004854 Hs.228320 
419948 AB041035 Hs.93847 

426214 H59846 Hs.128355 
427718 A1798680 Hs.25933 
414812 X72755 Hs.77367 
400285 NA 

422330 D30783 Hs.115263 
416182 NM004354HS.79069 
420077 AW512260 Hs.87767 
452281 T93500 Hs.28792 
434531 AA642007 Hs.1 16369 
408380 AF 123050 Hs.44532 
443788 AI732643 Hs.144151 
411078 AI222020 Hs.182364 
445495 BE622641 Hs.38489 
433426 H69125 Hs.133525 
424871 NKL004525HS.153595 

426215 AW963419 Hs.155223 
409045 AA635062 Hs.50094 
435525 AI831297 Hs.123310 
409203 AA780473 Hs.687 
424902 NMJ)03B66Hs.153687 
431725 X65724 Hs.2839 
418092 R45154 Hs.106604 
439840 AW449211 Hs.105445 
427811 M81057 Hs.180884 
420807 AA280627 Hs.57846 
426320 W47595 Hs.169300 
447078 AW885727 Hs.301570 
415786 AW419196 Hs.257924 
410102 AW248508 Hs.279727 
404347 

433687 AA743991 
421373 AAB08229 Hs.167771 
422634 NM_016010Hs.118821 
453310 X70697 Hs.553 
435957 N39015 Hs.190368 
40777i AL138272 Hs.62713 
443646 AI085198 Hs.164226 
446142 AI754693 Hs.145968 
444649 AW207523 Hs.197628 
435147 AL133731 Hs.4774 
439138 AI742605 Hs.193696 
429220 AW207206 Hs.136319 
428804 AK000713 Hs.193736 
453511 AL031224 Hs.33102 
439809 R41396 Hs.101774 
414869 AA157291 Hs.21479 
416276 U41060 Hs.79136 
452862 AW378065 Hs.8687 
452926 AI742170 Hs.31297 
453331 AI240665 Hs.8895 
420802 U22376 Hs.1334 
450603 R43646 Hs.12422 
422867 L32137 Hs.1584 
418004 U37519 Hs.87539 



f SS,TM 

SS,TM,EGF,SS 

.SS.UDPGT 

TM 

SS > TM ( calretlculln,SSJM,calreticulin, 

deathiU5.TM,Activln_recp,pklnase, 

TM 

SS 

ss 

,SS,TM,fh3,ig, 



hypothetical protein MGC10946 
amphiregulin (scrrwannoma-derived growth . 
Homo sapiens cDNA FU14438 fis, clone HE 



calmegin 
BMP-R1B 
ESTs 

synaptonemal complex protein X 
ESTs 

myosln-blndlng protein C, slow-type 



SS 1 Peptidase - Mlb l hernopexin,SS ( Peptidase_matrix metaDoproteinase 11 (MMP11; stro 



hypothetical protein FU23537 
NM_016931:Homo sapiens NADPH oxidase 4 ( 
ESTs, Moderately similar to ALU7_HUMAN A 
ESTs 

monokine Induced by gamma Interferon 
Eos Control 
eplreguDn 
cydin 62 
ESTs 

Homo sapiens cDNA FU11Q41 fis, done PL 
ESTs 



SS 

Fenricjeduct,TM,Ferric_reduct 
SS 

,SSJM,histone,Sec1,histone,sugarJr 
SS ( IL8,SS,I18 

,TM,ABC tran.ABCjnembrane, 
SS.TM.EGF.SS.TM 
cydin,SS " 
SS 

,SS,TGF-beta,TGFb_prop 
SS 

l SS,TM,ublqultin l 7tmJ,ANF_receptor l sush drubiquitin 
TM ESTs 
SS,SS CocoaCrisp 

SS,SS ( E^m^,l_LWEQ,ENTH,LLWEQ l DN/L^nls_reESTs l Weakly similar to I38022 hypotheti 
,SS,TM ESTs 
SS l EGF,ldLrecepLa,tdLrecepLb,SS l TM,E low density lipoprotem-felated protein 
SS stanntocalcin 2 

TM Homo sapiens mRNA; cDNA DKFZp434O0515 (f 

TM ESTs 

SS ( p450,SS,p450 cytochrome P450, subfamily IVB, porypept 

SS.SS inositol polyphosphate4-phosphatase, ty 

SS.CysJnotSS Norrie disease (pseudoglioma) 

f deathiU5 r pWnase^tfivin_recp, ESTs 
SS GDKF family receptor alpha 1 

SS^carbOpeptPropepJ/l14,SS ( Propep_M14cartoxypeptW B1 (tissue) 



SS,cpn10 

SS,TGF-beta,TGFb_propept!de,SS 

,SS,kazal, 

SS 

SS 

SS 

TM 

,SS,IMPDHJUMPDH_N,CBS 
SS 

TM,SNF,SS,TM,SNF, 
,SS,TM 

,TM,cpn60JTCP1,Sema, 
,TSPN.vwctspJ,EGF,thiored, 
JM.cadherin.Cadherln.C.term, 
,SS,rrm, 

JM.SDF.UPAa.LY6, 

TM . 

SS 

.SS.UDPGT 
SS.SS 
SS 
SS 

SSJM.TM 



ESTs 

transforming growth factor, beta 2 
ESTs 

hypothetical protein FU13782 

Homo sapiens cONA FU14035 fis, clone HE 

Target Exon 

gb:ny57g01.s1 NCLCGAP_Pr18 Homo sapiens 
ESTs 

CGI-62 protein 

solute carrier family 6 (neurotransmltte 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens mRNA; cONA DKFZp761C1712 (f 

ESTs 

ESTs 

hypothetical protein FU20706 
transcription factor AP-2 beta (acOvati 
hypothetical protein FU23045 
ubinudeinl 

LIV-1 protein, estrogen regulated 



,SS,Pep _M12B_propep,Reprolysin,tspJ, ESTs 

,SS,TM duodena! cytochrome b 

,SS l TM l disintegrinPepJ412Bj3ropep.Repro ESTs 

SS,NA,myb_DNA-blnding v-myb avian myeloblastosis viral oncogen 

SS ESTs 

SS,EGF,tsp_3,SS,E2F.JDP, cartilage oligomeric matrix protein (COM 

SS.aldedh.SS.aldedh, aldehyde dehydrogenase 3 family, member 
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9.5 
9.5 
9.4 
9.4 
9.4 
9.4 
9.3 
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9.1 
9.1 
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9.0 
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8.4 
8.4 
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8,3 
8.3 
8.3 
8.3 
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8.1 
8.1 
8.0 
8.0 
8.0 
8.0 
7.9 
7.9 
7.8 
7.8 
7.8 
7.7 
7.6 
7.6 
7.6 
7.6 
7.5 
7.5 
7.5 
7.5 
7.5 
7.4 
7.4 
7.3 
7.3 
7.2 
12 
7.2 
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426451 
450701 
419519 
410555 
433138 
411558 
409079 
417275 
432731 
442818 
407366 
427427 
410785 
401045 

418986 
442082 
442861 
418836 
422060 
444381 
404091 
417866 
428819 
410275 
425236 
415669 
416319 
412140 
442942 
446163 
443162 
409602 
428479 
400300 
433404 
410079 
401781 
447359 
402230 
427674 
428398 
458098 
419968 
416636 
419703 
424687 
449679 
421296 
442117 
400303 
419440 
444858 
432239 
440705 
400286 
446466 
423201 
433043 
439509 
425247 
409757 
425292 
448045 
452681 
452243 
439310 



AI908165 Hs.169946 

H39960 Hs.288467 

AI198719 Hs.176376 

U92649 Hs.64311 

AB029496 Hs.59729 

AA102670 Hs.70725 

W87707 Hs.82065 

X63578 Hs.295449 

R31178 Hs.287820 

AK001741 Hs.6739 
AF026942 

AF077345 Hs.177936 
AW603341 . 



C11001883*:gi|6753278|ref|NP_033938.1| c 
AI123555 Hs.81796 .SS.ReproIysln.tspJ, 



SS.GATA. 
,SS,LRR 
SS 

JM.dlsintegrin.Repfolysin, 
SS,lg,Sema,SS,Sema,efhand 
SSJM.SS.TM 
,TM ( fo3, 

SS.efhand.SS.efhand.ras 
,SS,fn3,fo1,fh2,m2,fn1 
WD40.SS 
JM.IBR . 

SS,lectin_cSS 
SS 

ICEj)20 l SS,lCEj)10,ICE_p20 l lC^4>10 l ICE_p 

6.7 



GATA-blndlng protein 3 (T-cell receptor 
Homo sapiens cDNA FLJ12280 lis, clone MA 
ESTs 

a disintegrin and metalloproteinase doma 
semaphorin sem2 

gamma-aminobutyric acid (GABA) A recepto 
fnterleukin 6 signal transducer (gp130, . 
parvalbumin 
fibronectin 1 

hypothetical protein RJ10879 
gb:Homo sapiens dg33 mRNA, partial sequ 
ESTs 

gb:IL2-UM0079-09030(W50-O03 UM0079 Homo 



R41823 Hs.7413 
AA243837 Hs.57787 
AI655499 Hs.161712 
R20893 Hs.325823 
BE387335 Hs.283713 
NA 

AW067903 Hs.82772 
AL135623 Hs.193914 
U85658 Hs.61796 
AW067800 Hs.1 55223 
NM_005025Hs.78589 
AI815601 Hs.79197 
AA219691 Hs.73625 
AW167087 Hs.131562 
AA026880 Hs.25252 
T49951 Hs.9029 
W26713 Hs.256972 



Y00272 
X03363 
T32982 
U94362 



Hs.184572 

Hs.102720 
Hs.58589 



NM_012093Hs.18268 
NA 

NMJ)03528Hs.2178 
AI249368 Hs.98558 
BE550224 Hs.74170 
X04430 Hs.93913 
N32536 Hs.42645 
AI793257 Hs.1 28151 
J05070 Hs.151738 
AI823951 Hs.129700 
NWL002666HS.103253 
AW664964 Hs.128899 
AA242758 Hs.79136 
AB020689 Hs.90419 
AJ199738 Hs.208275 
X81334 Hs.2936 
AA904244 Hs.153205 
NA 

H38026 Hs.308 
NM_000163Hs.125180 
W57554 Hs.125019 
AF086332 Hs.58314 
NMJ)05940Hs.155324 
NMJ)01898Hs.123114 
NWL005824HS.155545 
AJ297436 Hs.20166 
AF153330 Hs.30246 
AL355715 Hs.28555 
AF086120 Hs.102793 



JM.EPH Jbd,pkinase,SAM,fn3, 
SS 

,TM,A(rtMnjecp ( pklnase,death,ZU5 t 

.SSJM.CDSe 

,SS,mito_carr 

JM,7trTL3ANF_receptor, 

SS.Collagen.COLFl.TSPN.SS.TSPN 

SS.SS 

,SS,RfoosomaLS4e 
SS 

,SS,serpin, 

SS.TM.ig.SS.TM 

.SS.kinesin, 

.SS.ig.Sema.pkinase, 

,SS,TM,fn3, 

ftlamentSS.fOament.filament 
.SSJM.DAGKa.DAGKc, 
,SS ( pklnase,pkinase 



ESTs 

ESTs; calsyntenin-2 

ESTs 

ESTs 

ESTs, Moderately similar to ALU5JWMAN A 
ESTs, Weakly similar to S64054 hypotheti . 
Target Exon 

collagen, type XI, alpha 1 
KIAA0575 gene product 
transcription factor AP-2 gamma (activat 
stannlocatctn 2 

serine (or cysteine) proteinase Inhibito 
CD83 antigen (activated B lymphocytes, i 
RAB6 interacting, klnesin-iike (rabklnes 

ests 

prolactin receptor 
DKFZP434G032 protein 
ESTs 

ceil division cycle 2, G1 to S and G2 to 



SS.TM.pkinase^cep J^domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erb-b2, 



SS 

GlycoJransLS.SS 

.SS.fflamentPribosyltran.fflamentArmad 

SS.adenytateWnase, 

,SS.TM,p450, 

h!stone,SS,histone, 

.SS.TM 

SS 

SS.IL6.1L6. 
,SS,TM 
,SS,zf-C2H2, 



ESTs 
glycogenin 2 

Target Exon 



Target Exon 
H2B hfetcne family, member Q 
ESTs 

metallothionein 1E (functional) 
fnterleukin 6 (interferon, beta 2} 
solute carrier family 16 (monoca 
ESTs 



>xyilc 



SS,Peptio^se_M10,fr^,henriopexin,SS t TMPepmatrix metalloproteinase 9 (gelatinase B 
SS . tofloid-like 1 • 

SS periQpIn 
,SS,TM ESTs 
,SS,TM LIV-1 protein, estrogen regulated 

SS KIAA0882 protein 

SS ESTs, Weakly similar to ALUA_HUMAN!III 

SS,Peptidase JA10,hemopexln,SS,Pepfldasejratrix metalloproteinase 13 (coliagenase 



TM 

SS.TM.ABCJran.ABCjnembrane.SS 

anestin.SS 

SS,TM,m3,SS 

SS 

.SS.TM.Syntaxfo 

SS.PepffdaseJAIO.hemopexto.SS 

.SS.cystatin, 

SS 

,SS,TM 
.SS.TM 
SS 

,SS,TM,UDPGT f caseinJcappa 



ESTs 

C16000922:gi|7499103|pir||T20903 hypothe 

arrestin 3, retinal (X-arrestin) 

growth hormone receptor 

lymphoid nuclear protein (LAF-4) mRNA 

ESTs 

matrix metalloproteinase 1 1 (MMP11 ; stro 
cystatinSN 

37 kDa leucine-rich repeat (LRR) protein 

prostate stem ceil antigen 

solute carrier family 19 (thiamine trans 

programmed cell death 9 

ESTs 



7.1— 

7.1 

7.1 

7.1 

7.0 

7.0 

7.0 

7.0 

6.9 

6.9 

6.8 

6.8 

6.7 



67 
6.7 
6.6 
6.6 
6.6 
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6.6 
6.6 
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6.2 
6.2 
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6.1 
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5.8 
5.8 
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441111 
452355 
427711 
418636 
429353 
441690 
430447 
429698 
425325 

423600 
414737 
403593 
407758 
445234 
411165 
420633 
414117 
416783 
401093 
411096 
457411 
436007 
450506 
417975 
421072 
427032 
447752 
403199 
427122 
445900 
413048 
419563 
442432 
452093 
442323 
450606 
435542 
417576 
446089 
445413 
424420 
432378 
452190 
434674 
419986 
421582 
410361 
426327 
406639 
452834 
427315 
446733 
442118 
421524 
453060 
453403 
444301 
453619 
432656 
426384 
431701 
416931 
420854 
418867 
443514 
447499 



A1806867 Hs.126594 

N54926 Hs.29202 

M31659 Hs.180408 
AW749855 

AL1 17406 Hs.200102 

R81733 Hs.33106 

W17064 Hs.332848 

AI685086 Hs.26339 

X52730 Hs.1892 



,SS,TM,Phosphodiest 

TM.7tnu1.TM 

SS 

,SS,TM,HECT 

,SS,TM,ABCJran,ABCjnembrane, 

,SS,HECT,zf-UBR1,PABP l 14^-3, 

SS 

,SS,ras, 



ESTs — 
G protein-coupled receptor 34 
solute carrier family 25 (mitochondrial 
gb:QV4-BT0534-281299453-c05 BT0534 Homo 
ATP-blnding cassette transporter MRP8 
ESTs 

SWl/SNF related, matrix associated, acti 
ESTs, Weakly similar to S21349 probable 



SS,NNMT.PNMT_TEMT I SS,NNMT„PNMT_TEMT,STAR 



phenyiethanolamine N-methyttransferase 



AI633559 Hs.310359 
AH60386 Hs.125087 
NA 

D50915 Hs.38365 
AW137636 HS.146059 
NM_000169Hs.69089 
NM.014581HS.274480 
W88559 Hs.1787 
AA206186 Hs.79889 

U80034 Hs.685?3 

AW085961 Hs.130093 

AI247716 Hs.232168 
NM_004460Hs.418 

AA641836 Hs,30085 

AI215069 Hs.69113 

AF012023 Hs. 173274 

M73700 Hs.105938 
NA 

AW057736 Hs.323910 

AF070526 Hs.13429 

M93221 Hs.75182 

AA526235 Hs.193162 

BE093589 Hs.38178 

AA447453 Hs.27860 

AW016669 Hs.29190 

AI668605 Hs.60380 

AA687376 Hs.269533 

AA339449 Hs.82285 

AI860021 Hs.270651 

AA151342 Hs.12677 

BE614743 Hs.146688 

AI493046 Hs.146133 

H26735 Hs.91668 

AA831879 Hs.136985 

AI345455 Hs.78915 

AI910275 Hs.1406 

BE391804 Hs.62661 

W03242 Hs.44898 
M97711 

AI638627 Hs.105685 

AA179949 Hs.175563 

AA863360 Hs.26040 

AA976718 Hs.202242 

AA312082 Hs.105445 

AW294092 Hs.21594 

BE466639 Hs.61779 

AK000136 Hs.10760 

H87648 Hs.33922 

NMJ)00246Hs.3O76 
A1472078 Hs.303662 
AW935490 Hs.14658 
D45371 Hs.80485 
AW296927 
D31771 Hs.89404 
BE464288 Hs.141937 
AW262580 Hs.147674 



SS 
SS 

,CIDE-N,pkinase 
SS.SS 
,SS,TM 



5.1 

ESTs 

ESTs 

Target Exon 
KIAA0125 gene product 
ESTs 



SS,Melibiase l BTK,PH,pkinase,SH2,SH3,Ribo galactosidase, alpha 
SS,iipocalln,SSJipocaJin odorant-bindlng protein 2A 

JM,ionJrans,KJetra, proteolipid protein 1 (Pelizaeus-Merzbac 

SS,TM,TM monocyte to macrophage differentiation-a 

TM,LRRCT,TM,LRRCT, C12000586*:gi|6330167|dbj|BAA86477.1| (A 

Peptidase M3, mitochondrial intermediate peptidase 

SS ESTs 
,SS,adluzinc, ESTs 
SS,DPPIV_N - term,Peptidase_S9 l SS,DPPlV_NJbroblast activation protein, alpha 



,SS,trypsin 
SS 

.SS.14-3-3 

SS,transferrin,7tm_1 .transferrin, 
SSJM,Folate_canier t SS,TM,FoIatej»rTie 



hypothetical protein FU23186 
ESTs 

integrin cytoplasmic domain-associated p 
lactotransferrin 

NWL025243*:Homo sapiens solute carrier f 



,SSJM,pkinase,RecepJ^domain,SH2,PH,FurHER2 receptor tyrosine kinase (c-erWt>2, 



,$S,Ca_channel_B, 

SS,TM,lectin_c > fn2 l RiciruB M lectin,SS,TM 

SS 

SS 

,SS ( TM,7tm_1, 
,SS,TM l CBS,voltage_CLC 
.TM.GIycoJiydro.l 
,SS,pkinase,RhoGEF,ig,PH,SH3 i 



Homo sapiens clone 24767 mRNA sequence 
mannose receptor, C type 1 
Homo sapiens cONA FU11983 fis, clone HE 
hypothetical protein FU23468 
Homo sapiens mRNA; cDNA DKFZp586M0723 (f 
ESTs 

ESTs, Moderately similar to ALU6 JiUMAN A 
ESTs 



AIRS,formyLtransf,GARS,SS ( GARS,AiRS,for phosphoribosylglycinamide formyltransfer 

.pkinase ESTs, Moderately similar to A47582 B-cel 

SS,UPF0099,SS,UPF0099, CGI-147 protein 

,SS,TM,MAPEG, ■ prostaglandin E synthase 

,SS,TM,UDPGT ESTs 

,SS,TM,PH,SH2,Furin-like,pklnase,Recep - L Homo sapiens done PP1498 unknown mRNA 

,SS,HisLdeacetyl, ESTs . 

pkinase.OPR, GA-binding protein transcnption factor, 

SS,trefoil,SS t TM,tdLrecepLa,SRCR,tryps trefoil factor 1 (pS2) 



SS,TM,GBP,TM ( GBP 
SS 

SS,SS,ig, 

.SS.DEAD.ForkJiead 
SS 

,SS,TM,p450, 

jg.Sema, 

SS 

,SS,ras,Yj)hosphatase I ras 

,SS,HMG_box,filament 

SS.LRR.SS 

SS 

SS.LRR, 

.SSArfGap, 

.SS.BIR 

SS,C1q,CoiIagen,SS l C1q r 
,SSJM,PeptidaseJVI1, 
SS,homeobox,homeobox, 
.SSJHMiP. 

,SS,TM,cadher1n f cadherin 



guanyiate binding protein 1, interferon- 
Homo sapiens done TCCCTA00151 mRNA sequ 
gb;Human T-ceJI receptor (V beta 18.1, J 
KIAA1688 protein 

Homo sapiens mRNA; cDNA DKFZp564N0763 (f 
ESTs, Weakly similar to fatty add omega 
ESTs 

GDNF family receptor alpha 1 

hypothetical protein MGC15754 

Homo sapiens cDNA FU13591 fis, clone PL ■ 

asporfn(LRRdassl) 

Homo sapiens, done MGCS084, mRNA, comp 

MHC class II transactivator 

ESTs 

Human chromosome 5q13.1 done 5G8 mRNA 
adipose most abundant gene transcript 1 
gb:UI-H-BW0-ajw>O7-O-Ul.s1 NCI_CGAP_Su 
msh (DrosophOa) homeo box homolog 2 
ESTs 

protocadherin beta 16 



5.2 
5.2 
5.2 
5.2 
5.1 
5.1 
5.1 
5.1 



5.1 
5.1 
5.1 
5.0 
5.0 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.9 
4.8 
4.8 
4.8 
4.8 
4.8 
4.7 
4.7 
4.7 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.6 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
4.5 
45 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.4 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
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441560 F13386 Hs.7888 

409064 AA062954 Hs.141883 

422667 H25642 Hs.133471 

454032 W31790 Hs.194293 

5 . 432663 AI984317 Hs.122589 
401747 

432882 NM_013257Hs,279696 

437036 AI571514 Hs.133022 

447754 AW073310 Hs.163533 

10 443194 AI954968 Hs.279009 

451871 AI821005 Hs.118599 

457211 AW972565 Hs.32399 

421566 NM_000399Hs.1395 

431657 AI345227 Hs.105448 

15 427899 AA829286 Hs.332053 

444779 AI192105 Hs.147170 

442295 AI827248 Hs.224398 

436396 AI683487 . Hs.152213 

446039 AI150491 Hs.90756 

20 422938 NM_001809Hs.1594 

406922 S70284 

439285 AL133916 Hs.172572 

424800 AL035588 Hs.153203 

429922 Z97630 Hs.226117 

25 447178 AW594641 Hs.192417 

409038 T97490 Hs.50002 

452747 BE153855 Hs.61460 

420139 NM-005357HS.95351 

408877 AA479033 Hs.130315 

30 403329 NA 

439926 AW014875 Hs.137007 

430832 AI073913 Hs.100686 

432481 AW451645 HS.151504 

452410 AL133619 Hs.29383 

35 418661 NM_001949Hs.1189 

431958 X63629 , Hs.2877 

425071 NWL013969HS.154424 

447197 R36075 

428722 U76456 Hs.190787 

40 428330 122524 Hs.2256 

423242 AL039402 Hs.125783 

449048 Z45051 Hs.22920 

414831 M31158 Hs.77439 

413589 AW452631 Hs.313803 

45 408875 NM_015434Hs.48604 

418629 BE247550 Hs.86859 

450787 AB006190 Hs.25475 

414870 N72264 Hs.300670 

450325 AI935962 Hs.26289 

50 407633 NMJ)07069Hs.37189 

426172 AA371307 Hs.125056 

442262 BE170651 Hs.8700 

427961 AW293165 Hs.143134 

445563 AW873606 Hs.149006 

55 403943 

408761 AA057264 Hs.238936 

423279 AW959861 Hs.290943 

420440 NM.002407HS.97644 

445107 AI208121 Hs.147313 

60 428303 AW974476 Hs.1836Q1 

411667 BE160198 

427809 M26380 Hs.180878 

418203 X54942 Hs.83758 

430376 AW292053 Hs.12532 

65 444190 AI878918 Hs.10526 

433495 AW373784 Hs.71 

429638 AI916662 Hs.211577 



.pkinase, 

.SS.CUB, 

,SS,mFMO-Gke 

,SS,TM 

TM 

.SS.filamentfilament 
pkinase.pkinase_C, 
,SS,TM 
.pkinase, 
,SS,TM 
,SS,GDNF, 
WH1.WH1 
zf-C2H2.SS 
,SS,TM,pkinase 



SS,SA^roteins,ABC jnembrane.ABCJran.serom amyloid A1 



Homo sapiens done 23736 mRNA sequence 

ESTs 

ESTs 

ESTs, Weakly similar to 154374 gene NF2 
ESTs 

Homo sapiens keratin 17 (KRT17) 
serum/glucocorticoid regulated kinases 
ESTs 

Homo sapiens cONA FU14142 fis. done MA 

matrix Gla protein 

ESTs 

ESTs, Weakly similar to S51797 vasodilat 
early growth response 2 (Krox-20 (Drosop 
ESTs, Weakly similar to B34087 hypotheti 



SS 

,COLFI,vwc,CoIiagen, 
,wnt 

,TM,GlycoJrydro_1 

,S$JM,thiolase, 

SS,TM,Desaturase,SS 

,SS,ig,pklnase,LRRNT l LRRCT, 

HLH.SS 

,SS,TM t linker_hlstone,7tm_1 
,SS,TM 
SS,IL8,SS,ILS 
,SS,HLH 
.SSJUptiO, 
,SS,TM 
SS.SS 
SS 
SS 

,SS,Collagen,COLF1,TSPN l 
,SS,TM,ras 
SS 



ESTs 

Homo sapiens cDNA FU1 1469 fis, clone HE 
wingless-type MMTV integration site fami 
ESTs 

centromere protein A (17kD) 

gb:stearoyl-CoA desaturase [human, adlpo 

hypothetical protein FU20093 

MyoD family inhibitor 

H1 hlstone family, member 0 

ESTs 

small inducible cytokine subfamily A (Cy 

Ig superfamily receptor LNIH 

lipase, hormone-sensitive 

ESTs. Weakly similar to A47582 B-ceO gr 

Target Exon M - 

ESTs 



ESTs, Weakly similar to JE0350 Anterior 
Homo sapiens cDNA FU1 1973 fis, clone HE 
Homo sapiens mRNA; cDNA DKFZp434E2321 (f 
E2F transcription factor 3 
SSJM,Cadherin_Cjerni,cadherin l SS,TM,cadcadherin3 f type 1, P-cadherin (placenta 
SS,T4_delodInase,T4_delodinase, deiodinase, iodothyronine, type II 

,TM,SDF, gbyh88b01.s1 Scares placenta Nb2HP Homo 

.SS.TIMP, tissue inhibitor of metafloproteinase 4 

SS,PeptidaseJVl10,SS,Peptidase_M10,hemopematrix metalloproteinase 7 (MMP7; uterin 
SS DEME-6 protein 

SS,SS,TM similar to S68401 (cattle) glucose indue 

,SS,cNMP binding,Rlla,HMGJ?ox protein kinase, cAMP-dependent regulato 

p SS,abhydrolase ESTs, Highly similar to AF157833 1 nond 

SS DKFZP434B168 protein 

SS,SH2,PH,SS,TM,PH,SH2,Furin-like,pkinas growth factor receptor-tound protein 7 



SS,TM,MIP,SS,TM,MIP, 
SS 
SS 

TMJM 
.SS.DENN 
.SS.START, 
SS 

,SS,WH1,WH1 
p450,SS,p450 
,SS,TM,7tiTt1, 
SS 

.SS.SRCR.Uteroglobin 
,SS,TM 

SS,RGS,RGS.RGS 
TM 

,SS,lipase,PLAT.Sec7.PH, 

CKS.SS.CKS, 

SS 

SS 

SS,ig,MHCJ,connexin I SCAN ) SS,TM 
SS.TM.SS 



aquaportn 7 
K1AA1204 protein 
ESTs 

similartofatHREV107 
ESTs 

deleted in Over cancer 1 

ESTs 

ESTs 

C5000355:gi|4503225Iref|NP - 000765.1 1 cyt 
ESTs, Weakly similar to (defline not ava 
ESTs 

marnmaglobln 2 (MGB2; mammaglobin B; lip 
ESTs, Weakly similar to I38022 hypotheti 
regulator of G-protein signalling 16 
gb:QV1-HT0413-01020f>059-h03 HT0413 Homo 
lipoprotein lipase 
CDC28 protein kinase 2 
chromosome 1 open reading frame 21 
cysteine and glydne-rich protein 2 
alpha-2-gtycoproteln 1, zinc 
kinectin 1 (kfnesin receptor) 



4.3 
4.3 
4.3 
4.3 
4.3 
4.3 
4.2 
4,2 
4J2 
A2 
4.2 
42 
4.2 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.1 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
4.0 
3.9 
3.9 
3.9 
3.9 
3.9 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.8 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
3.7 
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454071 
451859 
420281 
427691 
428824 
424676 
418026 
457465 
417601 
407999 
425548 
446619 
411213 
406625 
417511 
428769 
407137 
401866 
451195 
426044 
426310 
440029 
408573 
431830 
444781 
431493 
414175 
411789 
418851 
453968 
407104 
449051 
434398 
454042 
459496 
414998 
407756 
442101 
449722 
452554 
421991 
420058 
425776 
407846 
406925 
445873 
416054 
414921 
452268 
428862 
412520 
410530 
401780 
447131 
418334 
415138 
421168 
431473 
421379 
411984 
408101 
405366 
414612 
411393 
435767 
416406 
433068 



AI041793 Hs.42502 
H44491 Hs.252938 
AI623693 Hs.191533 
AW194426 Hs.20726 
W23624 Hs.173059 
Y08565 Hs.151678 
BE379727 Hs.83213 
AW301344 Hs.122908 
NM.014735HS.82292 
AI126271 Hs.49433 
AA890023 Hs.1906 
AU076643 Hs.313 
AA676939 Hs.69285 
Y13647 Hs.1 19597 
AL049176 Hs.82223 
AW207175 Hs.106771 
T97307 

U10492 Hs.438 
AA502490 Hs.336695 
NM 000909HS.169266 
AW089705 Hs.293711 
AA284775 Hs.43148 
Y16645 Hs.271387 
NM_014400Hs.11950 
AI791493 Hs.129873 
AI308876 Hs.103849 
AF245505 Hs72157 
AI417828 Hs.192435 
AA847843 Hs.62711 
S57296 Hs.323910 
AW961400 Hs.333526 
AA121098 Hs.3838 
H22570 Hs.172572 
AA808940 Hs.274450 
NM.002543HS.77729 
AA1 16021 Hs.38260 
A1651930 Hs.135684 
BE280074 Hs.23960 
AW452434 Hs.58006 
NM.014918HS.1 10488 
AK001423 Hs.94694 
U25128 Hs.159499 
AA426202 Hs.40403 
L34041 Hs.9739 
AA250970 Hs.251946 
NM_002318Hs.83354 
BE390551 Hs.77628 
NM_003512Hs.28777 
NM 000346HS.2316 
AA442324 Hs795 
M25809 Hs.64173 

NWL0045B5HS.17466 
AA319233 Hs.5521 
C18356 Hs.295944 
AF182277 Hs.330780 
AA825686 Hs.321176 
Y15221 Hs.103982 
NM_005419Hs72988 
AW968504 Hs.123073 

BE274552 Hs76578 
AW797437 Hs.69771 
H73505 Hs.117874 
D86961 Hs.79299 
N!VL006456Hsi88215 



,TM,7tnLl, ESTa 

,SS,TM l EGF,ldLrecepLa ( ldLrecepLb l EGF ESTs, Weakly similar to ALU1JWMAN ALU S 



,SS|AAA} 

,SS,Gtycos_transL2, 
SS 

GlycosJransf^.Ridn^BJeciln.SS.GIycos 

lipocalin.SSJipocannJipocalin.ferriti 

.SS.Pribosyitran.Sulfatase 

PHD.pkinase.SS 

SS 

SS.mfrtf.SS.TIW, 
.SS.TM.efhandJonJrans 



SS,TM,CUB.F5_F8^type_C l MAM,SSJM,CUBJ5_TOUfDpEin 



ESTa 
ESTs 
ESTs 

UDP-N-acetyMpha-D^alactosamlne:polyp 
fatty acid binding protein 4, adipocyte 
DNA replication factor 
KIAA0215 gene product 

ESTs, Weakly simitar to YZ28JHUMAN HYPOT 
prolactin receptor 

secreted phosphoprotein 1 (osteopontin, 



SS,TM,Desaturase,SS 
SS 

,SS,7tm 1.SPRY, 

,SS,TM,GDA1_CD39 

.SS.filament, 

SS l homeobox,Ets l SS,homeobox, 
SS 

SS.WtmJ, 
SS 

,SS,TM,PMP25LCiaudln, 

.SS.TMJLS 
,SS,PH,!adamase_B 
,SS,p450,p450 



stearoyi-CoA desaturase (delfa-9-desatur 
chordin-like . 
ESTs 

gb:ye53h05,s1 Soares fetal liver spleen 
Target Exon 

mesenchyme homeo box 1 
ESTs 

neuropeptide Y receptor Y1 
ESTs, Weakly similar to S64329 probable 
ESTs 

small inducible cytokine subfamily A (Cy 
GPl-anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 
,TM,hemopexin,Peptidase_M10,hemopexin l Pehypothetical protein DKFZp761D112 
ig,LRRCT, DKFZP564I1922 protein 

,SS.TM ESTs 

,SS,HMG_box, Homo sapiens, clone IMAGE:3351295, mRNA 

t SS,TM,SH2 t PH 1 pkinase 1 Recep - .L.domain,Furv-erb-b2 avian erythroblastic leukemia v 
SS HER2 receptor tyrosine kinase (c-erb-b2, 

pWnase,POLOJ)ox,SS,pkinase l POLOJjox, serum-inducible kinase 
.SS.ig.pkinase.LRRNT.LRRCT, hypothetical protein FIJ20093 

,SS,TM.KRAB,SCAN.zf-C2H2,lg EST 

,SS,TM oxidised low density lipoprotein (lectin 

SS.UCH-1 ,UCH-2,SS,TM,G jtu transpept ublqultin specific protease 18 

ESTs 
cycfinBI 

ESTs, Weakly slmDar to ALUS JWMAN ALU S 
K1AA0990 protein 

Homo sapiens cDNA FU10561 fis, clone NT 
parathyroid hormone receptor 2 



SS 

cydln.SSJM.cyclIn, 

SS 

SS 

SS 

SS,TM,7trrL2,SS,TM,7trnJ2 



JM,ABC_membrane,ABC - tran,RibosoniaLS4eCbp/p300-interactir^ transactivatof, wit 
,SS,TM l transporU)rotSWIB,RhoGAP,DAG_PEglycerol-3-phosphatB dehydrogenase 1 (so 
I SS I rrm,PABP,pkinase,14-3-3,rrm poly(A)-bindlng protein, cytoplasmic 1-1 

.SS.TM.mito.carr.LysyLoxidase lysyloxidase4ike2 
SS 1 START I SS,START,NNMTJ ) NMT TEMT, steroidogenic acute regulatory protein r 
SS,histone,CaIc^CGRP„IAPP,lg ( MHCJ,SPRY,H2A histone family, member L 



SS,HMGJbox, 
histone,SS,histone,BolA 
ATP-synLab,SS,7tmJ,ATP-synLflb 
filamentSS,filament,filament 
TM 

,SS,TM,RibosomaLL27e, 

.Kunit^BPTI, 

SS,p450,SS 

SS 

SS.IL8, 

SH2,STAT,SS,STAT 
.pkinase, 

RboGEF,PH,SS,RhoGEF,PH, 

SAP.SS.FG-GAP.vwa 

SS,sushi,trypsin,vwa,rrm,fibrinogeruC,fn 

,SS,Peptidase_S8,P,Peptidase_S8,P 

.SS.TM 

.SS.Pribosyltran, 



SRY (sex determining region Y)-box 9 (ca 
H2A histone family, member O 
ATPase, H transporting, lysosomal (vacuo 
NI005557*:Homo sapiens keratin 16 (foca 
retinoic acid receptor responder (tazaro 
ESTs 

tissue factor pathway inhibitor 2 
cytochrome P450, subfamily )IB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
small inducible cytokine subfamily B (Cy 
signal transducer and activator of trans 
CDC2-related protein kinase 7 
NM_00337r:Homo sapiens vav 2 oncogene ( 
protein inhibitor of activated STAT3 
B-factor, properdin 
ESTs 

lipoma HMGIC fusion partner-fike 2 
slalyltransferase 



37 
37 
37 
37 
37 
37 
37 
37 
37 
37 
37 
37 
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445462 AA378776 Hs.288649 
439452 AA918317 Hs.57987 

452017 AF109302 Hs.27495 

409099 AK000725 Hs.50579 

5 452106 AI141031 Hs.21342 

447519 U46258 Hs.339665 

426928 AF037082 Hs.172914 
438825 . BE327427 Hs.79953 

414575 H11257 Hs.22968 

10 417837 AL079905 Hs.1103 

422128 AW881145 

445941 AI267371 Hs.172636 

429973 AI423317 Hs.164680 

444542 AI161293 , Hs.280380 

15 459561 AI547306 Hs.134981 

425741 AF052152 Hs.159412 

426501 AW043782 Hs.293616 

456508 AA502764 Hs.1 23469 

434228 242047 Hs.283978 

20 415752 BE314524 Hs.78776 ' 

400419 AF084545 

439750 AL359053 Hs.57664 . 

423858 AL137326 Hs.133483 

428514 AW236861 Hs.193139 

25 428698 AA852773 Hs.334838 

448988 Y09763 Hs.22785 

432072 N62937 Hs.269109 

417433 BE270266 Hs.82128 

452194 AI694413 Hs.332649 

30 444051 N48373 Hs.10247. 

420042 AW015140 Hs.161723 

457292 AI921270 Hs.334882 

421458 NM_003654Hs.104576 

431104 AW970859 Hs.313503 

35 443767 BE562136 Hs.9736 

419589 AW973708 Hs.201925 

415447 Z97171 Hs.78454 

443464 BE548446 Hs.5167 

423431 AA326062 

40 413278 BE563085 Hs.833 

458451 AW297181 Hs.195922 

440449 AA885430 Hs.201925 

413753 U17760 Hs.75517 

434876 AF160477 Hs.61460 

45 435575 AF213457 Hs.44234 

415773 R21651 Hs.324725 

446440 AV658411 Hs.42656 

450847 NM_003155Hs^5590 

426075 AW513691 Hs.270149 

50 452110 T47667 Hs.28005 

439963 AW247529 Hs.6793 

402837 NA 

439451 AF086270 Hs.278554 

406664 L34041 Hs.9739 

55 417315 AI080042 Hs.336901 

413011 AW068115 Hs.821 

414987 AA524394 Hs.294022 

429197 H24471 Hs.26930 

•' 448030 N30714 Hs.325960 

60 407604 AW191962 Hs.249239 

419092 J05581 Hs.89603 

456672 AK002016 Hs.114727 

443171 BE281128 Hs.9030 

452256 AK000933 Hs.28661 

65 432201 AI538613 Hs.298241 

406642 AJ245210 

400903 NA 



SS.SS 

SS.SS 

SS 

SS 

SS 

SS 



hypothetical protein MGC3077 

B-cell CLUIymphoma 11B (zinc finger pro 

prostate cancer associated protein 7 

hypothetical protein FU20718 

ESTs 

ESTs 



.SS.adhjhortJGF-beta.TGFbjjropepfide retinol dehydrogenase 5 (11-cis and 9-cl 

,SSJM,hlstone,ANF._receptDr l guanytate_cy ESTs 

.SS.pkinase.lg, Homo sapiens done IMAGE:451939, mRNA se 

SS,TGFb_propepfide,TGF-beta,SS transforming growth factor, beta 1 - 

' SS gb:GV0-OT0033-010400-182-a07 OT0033 Homo 

SS.SS.Bpoxygenase.PLAT ESTs 

,SS,T-box,UDPGT ESTs 

SS,SS,PeptidaseJv11 ,EGF,ig,tecfiiu;,sushl amfnopeptidase 



SS 

.pkinase, 
SS 
SS 

.SS.TM.7tmJ 
TM 

,SS,PeptidaseJrt1, 
,TM,integruv.B,Ricin_BJectln.rrm 
,SS,TM 

.SS,START,NNMTJ 3 NMT_TEMT, 
SS 

SSJM.TM 
.Sema.ig, 

SSJM.LRRCT,LRRm*,LRR,TM l LRRCT, 
,SSJM,7trnJ,ANF_receptQr,sush! 
,SS,ig, 
.SS.CUB, 

SS,TM,SS,TM,G-patch 
SS 

,Sema,fg, 

l SS,PCI 1 RasGEF,hormonejrec,zf-C4, 
,FGF, 

SS,aF,OLF,OLF,RibosomaU.4 
SS,TM,SSF,SS,TM 
,SS,p450,p450 

.SS.TM.ublquitin.lamlniruG.laminlrLEGF.k 
,SS,RibosomaU.14 
»FGF, 

SS,laminin^GF,laminia-Nterm3dh_short,S laminin, beta 3 (niceln (125kD), kalinin 



ESTs 

Homo sapiens done 24628 mRNA sequence 
ESTs 

ESTs, Weakly similar to AF208855 1 BM-01 
Homo sapiens PR02751 mRNA, complete cds 
putative transmembrane protein 
Taiget 

Homo sapiens mRNA full length Insert cDN 
Homo sapiens mRNA; cONA DKFZp434B0650 (f 
ESTs 

KIAA1866 protein 

gamma-amlnobutyric add (GABA) A recepto 
ESTs 

5T4 oncofetal trophoblast glycoprotein 
offertory receptor, family 2, subfamily 
activated leucocyte cell adhesion motecu 
ESTs 

hypothetical protein FU14251 
carbohydrate (keratan sulfate Gai-6) sul 
ESTs 

proteasome (prosome, macropain) 26S subu 
Homo sapiens cDNA FU13446 fts, done PL 
myodlin, trabecular meshwork inducible 
Homo sapiens mRNA; cDNA DKFZp434F152 (fr 
gb:EST29171 Cerebellum II Homo sapiens c 
Interferon-stirriulated protein, 15 kDa 
ESTs 

Homo sapiens cDNA FU13446 fls, done PL 



,SS,HLH 
SS,ig t SS.TM 

,SS l TM r RbosoniaLS3Ae,G^amma 
SS 

.SS.homeobox, 
.SSjfrtf, 

,SS,TM,Activin_recp,pWnase 

,TM,p450,EtS 

SS 

.SS.Chromo^shacrow.chromo, 



Ig superfamily receptor LNIR 
triggering receptor expressed on myeloid 
ESTs, Moderately similar to A47582 B-cel 
KIAA1681 protein 
stanniocaicin 1 

ESTs, Weakly similar to 2109260A B ceil 
Homo sapiens cDNA FU11309 fis, done PL 
platelet-activating fedor acetylhydrola 
ENSP00000241312*:DJ947L8.1.8 (novel Sush 
heterochromatin-IIke protein 1 



,SS,TM,transporLprot,SWIB > RhoGAP I DAG_PEglyceroW-phosphate dehydrogenase 1 (so 
,SS 1 RNAjJoLA l RNAj3oLA2 I RibosomaLS24e,ribosomal protein S24 

SS.LRR,LRRNT,SS,LRRNT,LRR, 
.SS.connexin.hormone.rec^f^.connexin 

,SS,GeIsoiln, 
.SSJM 
.SS.TM.C1q, 
SS.TM.SEA, 
,SS,PK,PK 

SSJMJtrnJ.muSS 
,TM,GDI,7tmJ, 

SSJM.trypsin.SS.TM.trefoll.trypsin.tref 
SS 
SS 



hypothetical protein FLI14950 
ESTs, Weakly similar to T20272 hypothec 
membrane-spanning 4-domalns, subfamily A 
collagen, type VIII, alpha 2 
mudn 1, transmembrane 
Homo sapiens, done MGC:16327, mRNA, com 
TONDU 

Homo sapiens cDNA FU10071 fis, clone HE 
Transmembrane protease, serine 3 
gb:Homo sapfens mRNA for Immunoglobulin 
Taiget Exon 



3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
33 
3.3 
3.3 
3.3 
3.3 
3.3 
3.3 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
3.2 
35 
3.2 
3.2 
3.2 
Z2 
3.2 
32 
32 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
3.1 
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434408 AI031771 Hs.132586 .SS.GIycoJiydroJ ESTs 3.1 

452994 AW962597 Hs.31305 SS.WD40.SS.WD40, KIAA1547 protein 3.1 

445903 AI347487 Hs.132781 fn3,SS,TM,EF1BD class I cytokine receptor 3.1 

424364 AW383226 Hs.201189 SS ESTs, Weakly similar to G01763 atrophln- 3.1 

5 410196 AI936442 ■ Hs.59838 UBACTjepeatSS.UBACT.repeaUhF^ 3.1 

419150 T29618 Hs.89640 ,TM,pkinase,fn3, TEK tyrosine kinase, endothelial (venous . 3.1 

433417 AA587773 Hs.8859 ,SS,SRCR, Homo sapiens, Similar to RIKEN cDNA 5830 3.1 

418624 AI734080 Hs.104211 ,Sema,ig, ESTs 3.1 

436291 BE568452 Hs.5101 .SS.abhydrolase, protein regulator of cytokinesis 1 . 3.1 

10 411000 N40449 Hs.201619 SS ESTs, Weakly similar to S38383 SEB4B pro 3.1 

412519 AA196241 Hs.73980 SS,Troponin,HemaggIuMn,SS,TM,C2,Tropo troponin T1, skeletal, slow 3.1 

450223 AA418204 Hs.241493 ,SS,proJsomerase, natural killer-tumor recognition sequenc 3.1 

422790 AA809875 Hs.25933 ,TM,histone,Sec1,histone,sugarJr ESTs 3.1 

424269 AW137691 Hs.199754 ,SSJR7tnL2,GPS ESTs 3.1 

IS 435854 AJ278120 Hs.4996 ,SS,WD40 putative ankyrin-repeat containing prate 3.1 

447388 AW630534 Hs.76277 ,SS l TM,rrm l oxidoredj6,oxidored„q6 Homo sapiens, clone MGC:9381, mRNA, comp 3.1 

451631 R00866 SS gfcye79c02.s1 Soares fetal liver spleen 3.0 

448105 AW591433 Hs.298241 .SS.TM.trefoll.trypsln.trefoil Transmembrane protease, serine 3 3.0 

438637 BE500941 Hs.126730 ,TM,PH, ESTs, Weakly similar to K1AA1214 protein 3.0 

20 423024 AA593731 Hs.325823 ,SS,TM,CD36 ESTs, Moderately similar to ALU5_HUMAN A 3.0 

456592 R91600 l SS,Ran_BP1,LIM,Ran_BP1,GRIP,TPR,UM gb^qlOcOZrl Soares fetal fiver spleen 3.0 

425920 AL049977 Hs.162209 SS,TM,SS,Tlvl,PMP2^.Claudin,PMP22LCIaudinclaudin8 3.0 

444670 H58373 Hs.332938 ,SS,TM hypothetical protein MGC5370 3.0 

401197 arf,arf. ENSP00000229263*:HSPC213. 3.0 

25 437755 AW204256 Hs.291887 ,wnt, ESTs 3.0 

452560 BE077084 Hs.336432 ,SS,rrm^f-RanBP,pkinase 1 C2.pkinase_C I DA ESTs 3.0 

410274 AA381807 Hs.61762 SS,SS hypoxia-inducible protein 2 3.0 

450098 W27249 Hs.8109 SS hypothetical protein FU21080 3.0 

404826 ,SS,TM Target Exon 3.0 

30 458389 H70284 Hs.160152 ,SS,RA ESTs, Weakly similar to FPHU alpha-fetop 3.0 

408196 AL034548 Hs.43627 HMG_box,pkinase^CHC,SS,TM^MG_box,SRY (sex determining region Y>box 22 3.0 

433675 AW977653 Hs.75319 .SS.ribonucred.sm, ribonucleotide reductase M2 polypeptide 3.0 

418848 AI820961 Hs.193465 ,death^U5,pkinase^ctivin_recp, ESTs 3.0 

422095 AI868872 Hs.282804 SS,Cu-oxidase.SS,Cu-oxIdase,Cu-oxidase hypothetical protein FLJ22704 3.0 

35 415992 C05837 Hs.145807 ,SS,TM hypothetical protein FU13593 3.0 

424631 AA688021 . Hs.179808 SS ESTs 3.0 

409956 AW103364 Hs.727 SS.TGF-beta.TGFbjjropeptide.SS.TGF-bete.inhibin, beta A (activin A, acOvin AB a 3.0 

419667 AU077005 Hs.92208 SS,disintegrin l Reprolysin,Pep - M12B_prope a disintegrin and metalloproteinase doma 3.0 

450946 AA374569 Hs.127698 SS ESTs, Moderately similar to 2109260A Be 3.0 

40 447770 AB032417 Hs.19545 Frizzied.Fz.SS.TM.Frizzled.Fz, frizzled (Drosophila) homolog 4 3.0 

423826 U20325 Hs.1707 SS.SS cocaine- and amphetamine-regulated trans 3.0 

418838 AW385224 Hs.35198 ,SS,TM,Phosphodiest, ectonucleotide pyrophosphatase/phosphodi 3.0 

442804 AW300118 Hs.131257 ,SS,TM ( G-gamma ESTs 3.0 

. 432284 AA532807 Hs.105822 ,SS,TM.pkmase, ESTs 3.0 
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TABLE 20A 

Table 20A shows the accession numbers for those pkeys lacking unigenelD's for Table 20. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene duster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

410785 1221055.1 AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AWBQ3268 AW803396 AW803334 AW803355 

411667 1253334 1 BE160198 AW935898T11520AW935930AW856073AW861034 

418636 177402J AW749855 AA225995 AW750208 AW750206 

420854 197072.1 AW296927 AI684514 AI263168AA281079 

422128 211994J AW881 145 AA490718 M85637 AA304575 T06087 AA331991 

423431 228162J AA326062 AA325758 AW962182 

423945 233566J * AA410943AW948953AA334202AA332882 

426878 273265 1 BE069341 AW748403 AL044891 AI908240 AA393080 

433687 373061J AA743991 AA604852 AW272737 

447197 711623 1 R36075 AI366546 R36167 

451631 878098J R00866 R01523 A1806815 

456207 165078_-1 AA193450 

456592 202684J R91600T87079AA291455 
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TABLE 20B 



Table 20B shows the genomic positioning for those pkeys lacking unigene IP's and 
accession numbers in Table 20. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 



entitled "The DNA sequence of human chromosome 22." Dunham I. etal., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand 


NLposition , ■ 

< 


400608 


9887666 


Minus 


9675647558 


400903 


2911732 


Plus 


59112-59228 


401045 


8117619 


Plus 


90044-90184,91111-91345 


401093 


8516137 


Minus 


22335-23166 


401197 


9719705 


Plus 


176341-176452 


401747 


9789672 


Minus 


11859^118816.119119-119244,119609-119761,120422-120990^30161-130381^30463-130593,131097-1312^ 








131932132451-132575133580-134011 


401780 


7249190 


Minus 


28397-28617.28920-29045,29135-29296.29411-29567^9705-29787,30224-30573 


401781 


7249190 


Minus 


83215-83435,83531-83656,83740-83901,84237-84393,84955-85037,86290-86814 


401866 


8018106 


Plus 


73126-73623 


402230 


9966312 


Minus 


29782-29932 


402408 


9796239 


Minus 


110326-110491 


402578 


9884928 


Plus 


66350-66496 


402606 


9909429 


Minus 


81747-82094 


402837 


9369121 


Minus 


2013-2186,9570-9758.11136-11309.19429-19677^1210-21455,23368-23562,24342-24527,29132-29320 


403199 


9958183 


Minus 


58895-59036,66618-66789 


403329 


8516120 


Plus 


96450-96598 


403593 


6862650 


Minus 


62554^2712,69449^9602 


403943 


7711864 


Plus 


100742-100904,101322-101503 


404091 


7684554 


Minus 


82121-83229 


404347 


9838195 


Plus 


74493-74829 


40483 


6572184 


Plus 


47726-48048 


405366 


2182280 


Plus 


22478-22632 


405654 


4895155 


Minus 


53624-53759 



listed. 



Pkey: 
Ref: 
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TABLE 21:210 GENES UP-REGULATED IN BREAST CANCER 
COMPARED TO NORMAL ADULT TISSUES THAT ARE 
5 LIKELY TO ENCODE EITHER ENZYMES OR PROTEINS 

AMENABLE TO MODULATION BY SMALL MOLECULES 

* 

Table 21 shows 210 genes up-regulated in breast cancer compared to normal adult tissues 
that are likely to encode either enzymes or proteins amenable to modulation by small 

1 0 molecules, llhese were selected as for Table 19, except that the ratio was greater than or equal 
to 3.0, the "average" normal adult tissue level was set to the 85 th percentile value amongst 
144 non-malignant tissues, and the 96 th percentile value amongst 73 breast cancers was 
greater than or equal 80 units, and the predicted protein contained a structural domain that is 
indicative of enzymatic junction or of being modulatable by small molecules (e.g. pkinase, 

15 peptidase, phosphatase, ATPase, or ion_transporter domains). The predicted protein domains 
are noted. 



20 



25 



30 



35 



40 



45 



50 



55 



Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnlgenelD: Unigene number 

Unigene Title: Unigene gene title 

R1 : Ratio of 93rd percentile tumor to 85* percentile of normal body tissue 



Pkey ExAccn UnigenelD Predicted Protein Domains 



449746 
400292 
424735 
407178 
408045 
450375 
429170 
445730 
424634 
420757 
424399 
447350 
456207 
456938 
402578 
425692 
424001 
418007 
421727 
411869 
400289 
443348 
424086 
400295 
408771 
421155 
424905 
438167 
459583 
423945 
445263 



AI668594 Hs.176588 
AA250737 Hs.72472 
U31875 Hs.272499 
AA195651 Hs.104106 
AW138959 Hs.245123 
AA009647 Hs.8850 
NMJJ01394HS.2359 
AI624342 Hs.170042 
NM.003613HS.151407 
X78592 Hs.99915 
AI905687 Hs.2533 
AI375572 Hs.172634 
AA193450 

X52509 Hs.161640 



D90041 

W67683 

M13509 

Y13153 

W20027 

X07820 



Hs.1 55956 

Hs.137476 

Hs.83169 

Hs.107318 

Hs.23439 

Hs.2258 



AW873596 Hs.1B2278 
A1351010 Hs.102267 
W72838 Hs.2533 
AW732573 Hs.47584 
H87879 Hs.102267 
NWL002497HS.153704 
R28363 Hs.24286 
AI907673 
AA410943 

H57646 Hs.42586 



SS,p450 

death,ZU5,TM,AcfivinjBcp ( pkinase t . 
SS.TM 

SS.Dihydroorotase, 
Phosphodlest,SomatomedIn_B, 

SS,TM,disintegrin p Pep_M12Bj)ropep l Reprol a dislntegrin and metalloprotelnase doma 



UnlgeneTftle 

ESTs, Weakly similar to CP4Y.HUMAN CYTOC 
BMP-R1B 

short-chain alcohol dehydrogenase family 
ESTs 
ESTs 



DSPc,Rhodanese, 

SSJM,Cation_efflux 

rg.tspJ.SS.AAA 

hormone rec,Androgen_recep,zf-C4, 
SS 



SSJM^SO^SO 



dual specificity phosphatase 4 
ESTs 

cartilage intermediate layer protein, nu 
androgen receptor (dlhydrotestosterone r 
aldehyde dehydrogenase 9 family, member 
ESTs 

gb:zr40e07.r1 Soares_NhHMPu_S1 Homosapi 



SS,TM,aminotran_1_2,CadherlruCJerm,cadhtyrosine aminotransferase 
SS,p450,SS,TM,p450 C1001134:gi|2117372|pir||l65981 fatty ac 

SSAcetyltransf2, N-acetyltransferase 1 (arylamine N-acety 

pkinase, paternally expressed 10 

SS,hemopexin l Peptee_M10,SS,Peptidase_matrix metaKoprotfilnase 1 (MMP1; Inters 

kynurenine 3-monooxygenase (kynurenine 3 
SS,Peptidase_M1, ESTs 

hemopexln,PeptidaseJWO,SS,PeptidaseJWOmatfc metalloprotelnase 10 (MMP10; str 
SS.DENN calmodulin 2 (phosphoryiase kinase, delt 

SS ( Lysyl_oxidase lysyl oxidase 

SS aldehyde dehydrogenase 9 family, member 

TM l K^tetra,iorLtrare,SS,TM l K_tetra,ton_t potassium voltage-gated channel, delayed 
SS,LysyLoxidaseAIdose_epim,Epimerase,S lysyl oxidase 
pkinase.SSJM.pWnase.pol^renyLsynt 
SS f TM,7tm_1,p450,mn ESTs 

pkinase, gb:IL-BT152-080399-004BT152Homosaplen 
deamZU5JM^dIvln - recp,pWnase l gb:zt32h03.r1 Soares ovary tumor NbHOT H 
SS^cyltransferase, - KIAA1560 protein 



R1 

65.7 
55.9 
53.8 
39.3 
34.9 
25.7 
24.9 
24.1 
21.7 
217 
20.3 
19.2 
18.3 
18.1 
17.8 
16.7 
16.5 
15.7 
15.3 
13.9 
135 
13,0 
12.B 
12.7 
11.9 

m 

11.7 
115 
115 
11.4 
11.2 
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60 



65 



423887 
429859 
428368 
418912 
451952 
402408 
415539 
406687 
419948 
400285 
408380 
409203 
424902 
431725 
418092 
427811 
423554 
426261 
413374 
432677 
456986 
453331 
407721 
418004 
410555 
443695 
423545 
401045 

442082 
418836 
404091 
450865 
424085 
449650 
432304 
415669 
442942 
428795 
428479 
400300 
447359 
402230 
424687 
432328 
432239 
400286 
425247 
434737 
439310 
441111 
452355 
427711 
429353 
425325 

448706 
403593 
432777 
446232 
411165 
414117 
411096 
450506 
417975 
447752 
427122 



AL080207 Hs.134585 
NMJ)07050Hs.225952 
BE440042 Hs.83326 
NM_000685Hs.89472 
AL120173 Hs.301663 
NA 

AI733881 Hs.72472 
M31126 Hs.272620 
AB041035 Hs.93847 
NA 

AF123050 Hs.44532 
AA780473 Hs.687 
NM_003866Hs.153687 
X65724 Hs.2839 
R45154 Hs.106604 
M81057 Hs.180884 
M90516 Hs.1674 
AW242243 Hs.168670 
NM_001034Hs.75319 
NI^L004482Hs.278611 
D38299 Hs.170917 
AI240665 Hs.8895 
Y12735 Hs.38018 
U37519 Hs.87539 
U92649 Hs.64311 
AW204099 Hs.337720 
AP000692 Hs.129781 



SS > TM,BRCT,ank^BC„tran l ABC_tran DKFZP434G232 protein . 
SS 1 TM,Yjjrwsphatase,MAM < fn3, protein tyrosine phosphatase, receptor t • 

SSPeptidase^M10 l hernopexln l SS,Peptidase - matrix metalloproteinase 3 (stromelysln 
SS,TM,7tnLl.SS l TM l 7tnLl. angiotensin receptor 1 

SS,pidnase, . ESTs 

SS,carb_anhydrase NkLQ30920*:Homo sapiens hypothetical pro. 

death,ZU5JM l Activin.recp ( pkinase, BMP-R1B 
SS,PeptidaseJ/l10,hemopexin,SS,Peptidasejnatrix metalloproteinase 11 (MMP11; stro 
Ferrfc_reductJM,Ferrlcjeduct NMJ1 6931 :Homo sapiens NADPH oxldase4 ( 

TM,ABCJran,ABC_rnembrane, Eos Control 

SS l TM,ubiqu!tln,7(3rrL3 l ANFjeceptor,sushl diubiquitfn 
SS,p450,SS,p450 cytochrome P450, subfamBy IVB, polypept 

SS f SS Inositol polyphosphate-4-phosphatase, ty 

SS.CysJnotSS Nome disease (pseudogKoma) 

death,ZU5,pkinase l Activin_recp > ESTs 
SS,2rucart)OpeptPropep_M14 l SS i Propep_M14carboxypeptldase B1 (tissue) 
GATase_2,SIS, glutamine-fructose-6-phosphate transamin 

SS,TM,WD40,ubiquitin I E1-E2^TPase,Cationperoxisornai famesyiated protein 
SS ribonucleotide reductase M2 polypeptide 

SS,TM,Glycbs - transL2,Ricin3-lectin > DPP UDP-N-acetyl-alpha-D-galactosamlne:polyp 
SS.WtmJ, prostaglandin E receptor 3 (subtype EP3) 

SS l TM,dislntegrin,Pep_M12Bjpropep,Reprol ESTs 

pkinase, duat-speciftcity tyroslne-(Y)-phosphoryl 

SS,aldedh,SS f atdedh. . aldehyde dehydrogenase 3 family, member 

TM,disintegrin,Repro)ysin, a disintegrin and metatioproteinase doma 

ESTs, Weakty similar to AF126780 1 retin 
GAF,PDEase chromosome 21 open reading frame 5 

ICEj>20,SS l ICE_p10,ICE_p20JCEj)10 t lCEj) 



C11001883*gi|6753278|ref|NPJ)33938.1| c 
R41823 Hs.7413 TM < EPH_lbd,pkinase ( SAM ( fn3, 

TM,ActMn_recp,pkinase,death,ZU5 l 



TM,7tm_3 l ANF_receptor ( 
zf-C2H2 

SS^AA,VtraLhelicase1,rrm, 
TM,ionJrans,SS,TM,ionJrans, 

TM,7tmJ. 
SS,serpin, 

SS.tg.Sema.pkinase, 
SSJM 

SS.pkinase.pklnase 



AI655499 Hs.161712 
NA 

AI248013 Hs.106532 
NMJ)02914Hs.139226 
AF055575 Hs.23838 
AA932186. Hs.69297 
NWL005025HS.78589 
AW167087 Hs.131562 
R45503 Hs.97469 
Y00272 Hs.184572 
X03363 

NM.012093HS.18268 
NA 

J05070 Hs.151738 
AI572739 Hs.195471 
X81334 Hs.2936 
NA 

NM_005940Hs.155324 
AA828246 Hs.291884 
AF086120 Hs.102793 
AI806867 Hs.126594 
N54926 Hs.29202 
M31659 Hs.180408 
AL1 17406 Hs.200102 
X52730 Hs.1892 
phenylethanotamine N-methyiiransferase 
AW291095 Hs.21814 SS.TM.pkmase. 
NA < 

AA564991 Hs.269477 
AI281848 Hs.194691 
NMJ)00169Hs.69089 
W88559 Hs.1787 
U80034 Hs.68583 
NM.004460Hs.418 
AA641836 Hs.30085 
M73700 Hs.105938 
AW057736 Hs.323910 



6.7 

ESTs; catsyntenin-2 
ESTs 

Target Exon 

ESTs, Weakly similar to 138588 reverse t 
replication factor C (activator 1) 2 (40 
calcium channel, voltage-dependent I ty 
ESTs 

serine (or cysteine) proteinase inhibito 
ESTs 

ESTs, Highly similar to A39769 N^cetyll 
cell division cycle 2, G1 to S and G2 to 



SS,TM,pkinase,Recep„U - domain ( SH2 ( PH l FuriHER2 receptor tyrosine kinase (c-erb-b2, 
SS.adenylatekinase, adenylate kinase 5 

SS,TM,p450, Target Exon 

SSPeptWase_M10,fn2,hemopexin,SS,TM,Pepmatrix metalloproteinase 9 (gelatinase B 
6PF2K.PGAM, 6-phosphofm(^2-Wnase/rnJCtose-2,6-bl 
SS.PepUdaseJ/IIO.hemopexIn.SS.Peptidase.matnx metalloproteinase 13 (coliagenase 
SS,TM^BC.tran^BC_membrane,SS C16000922:gI|7499103|pir||T20903 hypothe 
SS,Peptidase_M10,hemopexln,SS matrix metalloproteinase 11 (MMP11; stro 

UCH-1 l pklnase l OPR,Rhodanese l AMP-binding l ESTs 
SS t TM,UDPGT,caseirLkappa ESTs 
SS,TM,Phosphodiest, ESTs 
TM,7tm_1 ,TM G protein-coupled receptor 34 

SS solute carrier family 25 (mitochondria) 

SSJM.ABC tran^BC membrane, ATP-binding cassette transporter MRP8 

SS,NN^PNlVn'_TEMT l SS l NNMT_PNMT_TEMT,STAR 

5.1 

interleukin 20 receptor, alpha 
CIDE-N,pkinase Target Exon 

aipha-amylase, . ESTs 

SS,TK7triL3,Rlbo5omaU.13 retinoic acid induced 3 

SS,Piteiibiase,BTK l PH,pkinase,SH2,SH3,Ribo galactosldase, alpha 
TM.ioriJrans.KJetra, proteolipW protein 1 (Pellzaeus-Merzbac 

Peptidase_M3, mitochondrial intermediate peptidase 

SS ) DPPOJerm.Peptidase.S9,SS,DPPiV_N Jbroblast activation protein, alpha 
SS.trypsJn hypothetical protein FU231 86 

SS,transferrin,7tmJ .transferrin. lactotransferrin 
SS,TM ( pkinase,Recep_Ldomain,SH2,PH,FuriHER2 receptor tyrosine kinase (c-erb-b2, 
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103 

103 

10.3 

9.8 

9.4 

9.1 
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8.8 

8.6 

83 

8.3 

83 

83 

8.2 

8.1 

73 

7.6 

7.6 

7.5 

73 

11 

7.2 

7.1 

6.9 
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400181 NA 

452093 AA447453 Hs.27860 
435542 AA687376 Hs.269533 
417576 AA339449 Hs.82285 
5 446089 A1860021 ' Hs.270651 
424420 BE614743 Hs.146688 
452190 H26735 Hs.91668 
419986 AI345455 Hs.78915 
421582 AI910275 Hs.1406 

10 446733 AA863360 Hs.26040 
453080 AW294092 Hs.21594 
400205 NA 
420854 AW296927 
432690 AF181490 Hs.278627 

15 441560 F13386 Hs.7888 
416445 AL043004 Hs.79337 
439024 R96696 Hs.35598 
432882 NM 013257Hs.279696 
447754 AW073310 Hs.1 63533 

20 453775 NMJJ02916HS.35120 
431657 AI345227 Hs.105448 
427899 AA829286 Hs.332053 
422938 NMJ)01809Hs.1594 
418478 U38945 Hs.1 174 

25 406922 S70284 

439285 AL133916 Hs.172572 
429922 Z97630 Hs.226117 
420139 NM_005357Hs.95351 
425071 NMJ)13989Hs.154424 

30 424511 BE300512 Hs.193557 
428722 U76456 Hs.190787 
428330 L22524 Hs.2256 
414831 M31158 Hs.77439 
413589 AW452631 Hs.313803 

35 418629 BE247550 Hs.86859 
413453 AA129640 Hs.128065 
403943 

444618 AV653785 Hs.173334. 
408761 AA057264 Hs.238936 

40 427809 M26380 Hs.160878 
418203 X54942 Hs.83758 
454071 A1041793 Hs.42502 
424676 Y08565 Hs.151678 
457465 AW301344 Hs.122908 

45 417601 NM_014735Hs.82292 
446619 AU076643 Hs.313 
406625 Y13647 Hs.1 19597 
428769 AW207175 Hs.106771 
426310 NM_000909Hs.169266 

50 417531 NMJX)3157Hs.1Q87 
444781 NM.014400HS.11950 
431493 AI791493 Hs.129873 
428966 AF059214 Hs.194687 
414175 AI308876 Hs.103849 

55 455325 AW895719 

429597 NM.003816Hs.2442 
425320 U29344 Hs.83190 
431854 AA383550 Hs.271699 
407104 S57296 Hs.323910 

60 449051 AW961400 Hs.333526 
434398 AA121098 Hs.3838 
454042 H22570 Hs.172572 
407756 AA116021 Hs.38260 
401464 AF039241 Hs.9028 

65 412970 AB026436 Hs.177534 
412049 N53437 Hs.18268 
425776 U25128 Hs.159499 



SS,TM,3Beta_HSD, ENSP00000171555:CDNA FU10727 fts, clone 

SS,TM,7tm 1 , Homo sapiens mRN A; cDNA DKFZp586M0723 (f 

SS,pkinase 1 RhoGEF 1 lg,PH,SH3, ESTs 

AIRS,forniyUransf,GARS,SS,GARS,AIRS,for phosphoribosylglyctnamlde forrnyftransfer 
pkinase ESTs, Moderately similar to A47582 B-cel 

SS.TM.MAPEG, prostaglandin E synthase 

SS,TM,PH,SH2,Fun7vlike,pWnase l RecepJ_ Homo sapiens clone PP1498 unknown mRNA 
pkinase.OPR, GA-binding protein transcription factor, 

SS,trefoil,SS,TM,ldLrecepLa l SRCR,tryps trefoil factor 1 (pS2) 



SS,TM,p450. 

SS,ras,Y_phosphatase,ras 

SS,TM,Peptidase_M1, 
SS,pyr_redox f SS,RibosomaLL39 
p kinase, 
SS.pkinase, 
SSJM.trypsln.vwd.tg 
pkinase,pkinase_C, 
p kin as e , 

SS,AAA,PI3 PI4_kinase,Pl3Ka,PI3K_rbd,PI3 replication factor C (activator 1)4 (37 
SSJM.pkfoase ESTs, Weakiy similar to B34087 hypotheti 

SS,SAAjJ«)telns^BC_membraneABC_tran l serum amyloid A1 



ESTs, Weakly similar to fatty acid omega 
hypothetical protein M6C15754 
NfVL006265*:Homo sapiens RAD21 (S. pombe) 
gbUI-H-BW0-aiw>07-(WJU1 NCI_CGAP.Su 
prenylcysteine lyase 

Homo sapiens clone 23736 mRNA sequence 
KIAA0135 protein 
ESTs 

serum/gtucocorticoid regulated kinase-ii 
Homo sapiens cDNA FU14142 fis, done MA 



SS.TM.thblase, ■ 
ank.ank 

SS,TM,Desaturase,SS 
SS,ig,pklnase,LRRNT,LRRCT, 

SS,TM,Gnker_histone,7tmJ 
SS,TM,p450, 

SS,T4_deiodinase,T4_deiodinase ( 
SS,Y_phosphatase,Band_41 
SS.TIMP, 



centromere protein A (17kD) 
cyclirwlependent kinase inhibitor 2A (me 
gbrstearoyl-CoA desaturase [human, adipo 
hypothetical protein FU20093 
HI histone family, member 0 
lipase, hormone-sensitive 
detodinase, iodothyronine, type !l „ 
ESTs, Moderately similar to ALU7JWMAN A 
tissue inhibitor of metailoproteinase 4 



SS,Peptidase_M10,SS,Peptidase - M10,hemopematrix metailoproteinase 7 (MMP7; uterin 
SS,cNMP_blnding,Rlla,HMG_box protein kinase, cAMP-dependent, regulato 

SS.abhydrofase * ESTs, Highly similar to AF157833 1 noncl 

SS,SH2.PH,SS JM,PH,SH2,Furin-fike,pklnas growth factor receptor-bound protein 7 



ESTs 

C5000355:gi|4503225|ref|NPJ00765.1| cyt 
ELL-RELATED RNA POLYMERASE II. ELONGATIO 
ESTs, Weakly similar to (defline not ava 
lipoprotein lipase 
CDC28 protein kinase 2 
ESTs 

UDP-N-acetyl-alpha-D-gatactosam}ne:polyp 
DNA replication factor 
KIAA0215 gene product 
secreted phosphoprotein 1 (osteopontin, 
stearoyl-CoA desaturase (deIta-9-desatur 
ESTs 

neuropeptide Y receptor Y1 
serine/threonine kinase 2 
GPl-anchored metastasis-associated prote 
ESTs, Weakly similar to A36036 cytochrom 



SS.PeptIdase_C1.gpdh 
p450,SS,p450 

SS,TM,7tmJ, 
SSJipase.PLAT.Sec^PH, 
CKS.SS.CKS, 
TM.7tn_1, 

Glycos - transfJ,RiclrLB_lectin,SS,Glycos 
SS.Pribosyltran.Sulfatase 
PHD,pkinase,SS 
SS,TM,efhand,tonJrans 
SS,TM,Desaturase,SS 
SS,7trtLl,SPRY, 
SS.WtmJ, 

SS.pkinase.vwa.vwa.GlycoJransfjB 
SS.PH.IactamaseJB 
SS.p450,p450 

cholesterol 25-hydroxylase 
TM.hemopexin.Peptidase.MIO.hemopexinPephypothetical protein DKFZp761D112 
TM,ion_trans.ietetra, gb:QV4-NN0039-290300-1 54406 NN0039 Homo 

a disintegrin and metailoproteinase doma 
A<^transf,adrL2inc 1 ket<)acykyntpp-bi fatty acid synthase 
IMS.SS polymerase (DNA directed) lota 

SSJM,SH2 # PH J pkinase l Recep_Ldomain,Furiv-erb-b2 avian erythroblastic leukemia v 
SS HER2 receptor tyrosine kinase (oerb-b2, 

ptdnase,POLOj30x,SS,pkinase,POLO_box. serum-inducible kinase 
SS.ig.pklnase.LRRNT.LRRCT, hypothetical protein FU20093 

SS,UCH-1 ,UCH-2,SS,TM,G_gluJranspept ubiquiUn specific protease 18 

histone deacetylase 5 
dual specificity phosphatase 10 
adenylate kinase 5 
parathyroid hormone receptor 2 
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Peptidase_M24, 
DSPcMyosinJaii, 
SS.adenylatekinase, . 
SS,WtmJ.SS,TM,7tm_2 
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407846 AA426202 Hs.40403 

406925 L34041 Hs.9739 

445873 AA250970 Hs.251946 

418054 NM.002318HS.83354 

5 408815 AA833930 Hs.288036 

410530 M25809 Hs.64173 

407021 U52077 

421168 AF182277 Hs.330780 

431473 AA825686 Hs.321176 

10 408101 AW96B504 Hs.123073 

422083 NM_001141Hs.111256 

411393 AW797437 Hs.69771 

435767 H73505 Hs.1 17874 

433068 NNL006456HS.288215 

15 426928 AF037062 Hs.172914 

414575 H11257 Hs.22968 

445941 AI267371 Hs.172636 

444542 AI161293 Hs.280360 

425741 AF052152 Hs.159412 

20 434228 Z42047 Hs.283978 

433264 D85782 Hs.3229 

400419 AF084545 

439750 AL359053 Hs.57664 

417757 R19897 ■ Hs.106604 

25 452194 AI694413 Hs.332649 

421458 NMJ03654HS.104576 

443767 BE562136 Hs.9736 

422648 D86983 Hs.118893 

423431 AA326062 

30 451264 AI768235 

452110 T47667 Hs.28005 

439963 AW247529 Hs.6793 

453941 U39817 Hs.36820 

406664 L34041 Hs.9739 

3 5 phosphate dehydrogenase 1 (so 

453487 R31770 Hs.23540 

420911 U77413 Hs.100293 

443171 BE281128 Hs.9030 

452256 AK000933 Hs.28661 

40 432201 AI538613 Hs.298241 

419150 T29618 Hs.89640 

444443 AI149286 Hs.55099 

426283 NMJJ03937HS.169139 

436291 BE568452 Hs.5101 

45 450223 AA418204 Hs.241493 

424269 AW137691 Hs.199754 

448105 AW591433 Hs.298241 

452560 BE077084 Hs.336432 



TM^BC_membrane^C_tran,RibosonraLS4eap^(HWntera(^ transactivator, wit 
SS JM 1 transportprot,SWlB,RhoGAP4)AGJ > E-glycerol-3-phosphate dehydrogenase 1 (so 



SS l rrm l PABP,pWnase,14-3-3 l rrm 
SS,TM 1 mto_cajT,Lysyl„oxWase 
SSJPPT, 

ATP-synLab,SS,7tnu1 ATP-synLab 

SS,p450,SS 
SS 



lipoxygenasepLAT, 
SS.sushi.trypsIn.vwa.nrm.fibrinogerLC.fn 
SS,Peptidase_S8,P,Peptidase..S8,P 
SS,Pribosyltran, 

SS ( adh_short t TGF-betaJGFbj3ropeptIde 

SS.pklnaseJg, 

SS,SS,lipoxygenase,PLAT 

SS.SS.PeptldaseJ^I.EGF.tg.tectirucsushi 

pkinase, ' 

SS f TM,7tmJ 



SS,Peptidase_M1, 
TM,integrin_B l RidrLBJeciin t rrm 
deamZU5,pkinase,ActMruBcp, 
SS,TM,7trn_3,ANF._receptor 1 sushi 
SS 

SS,PCI ( RasGEF,hormone_n3C,zf-C4, 
peroxidase,LRRCT ) 
SS.p450.p450 
SS.Trehalase 

SS,TM,Activjn_recp,p)dnase 
TM,p450,Ets 

SS.DEAD.HRDC,helicase.C ( 
SSJM ( transponU>rotSWIB^hoGAP.DAGJ 5 E 
3.1 

TM,7tm_1. 



poly(A)-binding protein, cytoplasmic 1-1 
lysyloxidase-flkB2 
tRNA isopentenytpyrophosphate transferas 
ATPase, H transporting, lysosomal (vacuo . 
gb:Human marinerl transposase gene, comp 
cytochrome P450, subfamily IIB (phenobar 
ESTs, Weakly similar to S65824 reverse t 
CDC2-related protein kinase 7 
arachidonate 15-fipoxygenase, second typ 
B-factor, properdin 
ESTs 

siaJyitransferase 

retinol dehydrogenase 5 (1 1-cis and 9-ci 
Homo sapiens clone IMAGE:451 939, mRNA se 
ESTs 

aminopeptidase 

Homo sapiens clone 24628 mRNA sequence 
Homo sapiens PR02751 mRNA. complete cds 
cysteine dioxygenase, type I 
Target 

Homo sapiens mRNA full length insert cDN 
ESTs 

olfactory receptor, family 2, subfamily 
carbohydrate (keratan sulfate Gal-6) sut 
proteasome (prosome, macropain) 26S subu 
Melanoma associated gene 
gb:EST29171 Cerebellum II Homo sapiens c 
gfcwg82g08jci SoaresJJSF_F8JWJJT_PAJ J ._S 
Homo sapiens cDNA FU1 1309 fis, clone PL 
platelet-activating factor acetyihydrofe 
Bloom syndrome 



SS,TM,7trnJ,rrm P SS 
TM,GDI,7trn_1, 

SS.TM.tjypsin.SSJM.trefoil.trypsin.tref 
TM.pxinase.fh3, 
SS 

SS,abhydro!ase, 
SS.proJsomerase, 
SS,TM,7trrL2,GPS 
SSJM.trefoii.trypsin.trefbii 
SS l rrm l rJ-RanBP,pkInase l C2 1 pkinase_C,DAGESTs 



ESTs 

Oiinked N-acefylgJucosamine (GlcNAc) tr 
TONDU 

Homo sapiens cDNA FU10071 fis, clone HE 
Transmembrane protease, serine 3 
TEK tyrosine kinase, endothelial (venous 
rab6 GTPase activating protein (GAP and 
kynurenlnase (L-kynurenine hydrolase) 
protein regulator of cytokinesis 1 
natural killer-tumor recognition sequenc 
ESTs 

Transmembrane protease, serine 3 
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3.4 
3.4 
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3.0 
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TABLE 21 A 

Table 21 A shows the accession numbers for those pkeys lacking unigenelD's for Table 21. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset identifier number 

CAT number. Gene duster number 
Accession: Genbank accession numbers 



Pkey CAT number Accessions 

420854 197072.1 AW296927 AI684514 A1263168 AA281079 

423431 228162 1 AA326062AA325758AW9B2182 

423945 233566 1 AA410943 AW948953 AA334202AA332882 

451264 863988 1 AI768235 R31400 H29082 H23107 

455325 1279475J AW895719 N31451 N41451 

456207 165078.-1 AA193450 
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TABLE 21B 

- 

Table 21B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 21 . For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ref. Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. Dunham I. et al' refers to the publication 

• entitled "The DNA sequence of human chromosome 22." Dunham I. et al„ Nature (1999) 402:489495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLpositlon: Indicates nucleotide positions of predicted exons. 

Pkey Ref Strand Nt_pos(tion 

» 

401045 8117619 Pius ' 90044-90184,91111-91345 

402230 9966312 Minus 29782-29932 

402408 9796239 Minus 110326-110491 

402578 9884928 Plus 66350-66498 

403593 6862650 Minus 62554-6271 2,69449-69602 

403943 7711864 Plus 100742-100904,101322-101503 

404091 7684554 Minus 82121-83229 
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TABLE 22: 739 GENES UP-REGULATED IN BREAST CANCER 

COMPARED TO NORMAL ADULT BREAST 

5 

Table 22 shows 739 genes up-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 19, except that the ratio was greater than or equal to 3.0, the 
denominator was the 85 th percentile value for 12 non-malignant breast specimens, and the 
96 th percentile value amongst the 73 breast cancers was greater than or equal 100 units. 

10 

Pkey: Unique Eos probeset Identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnkjenelD: Unigene number 

15 Unigene Title: Unigene gene title » 
R1: Ratio of 90 th percentile tumor to 85* percentile norma) breast tissue 



Pkey ExAccn Unigene© UnigeneTrtte R1 

20 

400292 AA250737 Hs.72472 BMP-R1B 515 

424735 U31875 Hs.272499 short-chain alcohol dehydrogenase family 38.3 . 

400297 AI127076 Hs.334473 hypothetical protein DKFZp56401278 295 

431448 AL137517 Hs.334473 hypothetical protein DKFZp56401 278 26.9 

25 451110 AI955040 Hs.265398 ESTs, Weakly sln^ar to transformations 25.8 

431211 M86849 Hs.323733 gap junction protein, beta 2, 26kD (conn 23.2 

418203 X54942 Hs.83758 CDC28 protein kinase 2 22.6 
407980 AA046309 gb:zf12fD1.s1 SoaresJetaLhearLNbHH19W 19.8 

414646 AA353776 Hs.901 CD48 antigen (B-cell membrane protein) 185 

30 446921 AB012113 Ms.16530 small Inducible cytokine subfamily A (Cy 18.0 

409041 AB033025 • Hs50081 K1AA1 199 protein 17.6 

412140 AA219691 Hs.73625 RAB6 interacting, kinesln-like (rabkines 17.6 

407824 AA147884 Hs.9812 Homo sapiens cDNA RJ14388 fis, done HE 17.1 

453160 AI263307 Hs.239884 H2B histone family, member L 17.0 

35 407137 T97307 gb:ye53h05.s1 Scares fetal Over spleen , 16.1 

425692 D90041 Hs. 1 55956 N-acetyltransferase 1 (aiylam)ne N-acety 16.1 

438533 AI440266 Hs.170673 ESTs, Weakly similar to T24832 hypothetl 16.0 

428227 AA321649 Hs.2248 smaB Inducible cytokine subfamily B (CX 155 

. 444342 NM_014398Hs.10887 similar to lysosome-associated membrane 15.1 

40 422505 AL120862 Hs.124165 programmed ceil death 9 (PDCD9) 14.9 

430515 AA746503 Hs.283313 ESTs 14.7 

417308 H60720 Hs.81892 KIAA01 01 gene product 14.4 

452744 AI267652 Hs.30504 Homo sapiens mRNA; cDNA DKFZp434E082 (fr14.4 

412446 AI768015 Hs.92127 ESTs 14.2 

45 415539 AI733881 Hs.72472 BMP-R1B 14.1 

435496 AW840171 Hs.265398 ESTs, WeaHy similar to transfbrmation-r 135 

438209 AL120659 Hs.6111 aryl-hydrocarbon receptor nuclear transl 13.8 
400205 NA NNL006265*:Homo sapiens RAD21 (S. pombe)13.5 

430965 AA489732 Hs.154918 ESTs 13.4 

50 415263 AA948033 Hs.130853 ESTs 135 

451952 AL120173 Hs.301663 ESTs 13.2 

449722 BE280074 Hs.23960 cydinBI' 13.2 

406685 M18728 gb:Human nonspecific crossreacting antig 13,0 

406690 M29540 Hs.220529 cardnoembryonlcantigen^lated cell ad \Z& 

5 5 429925 NM_000786Hs.226213 cytochrome P450, 51 (lanosterol 14-alpha 12.8 

416498 U33632 Hs.79351 potassium channel, subfamily K, member 1 127 

432378 AI493046 Hs.146133 ESTs 125 

441377 BE218239 Hs.202656 ESTs 125 
456207 AA193450 gb:zr40e07.r1 Soares_NhHMPu_S1 Homo sapil 2.4 

60 422805 AA436989 Hs.121017 H2A histone family, member A 12.2 

407811 AW1 90902 Hs.40098 cysteine knot superfamily 1 , BMP antagon 12.2 

407178 AA195651 Hs.104106 ESTs 12^ 
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420931 AF044197 Hs.100431 small inducible cytokine B subfamily (Cy 12.1 

421727 Y13153 Hs.107318 kynuienlne 3-monooxygenase (kynurenlne 3 1Z0 

434408 AI031771 Hs.132586 ESTs 1Z0 

446591 H44186 Hs.15456 PD2 domain containing 1 11.9 

431385 BE178536 Hs.11090 rnembrane-spanntng 4-domatns, subfamDy A 11.8 

443348 AW873598 Hs.1 82278 calmodulin 2 (phosphorylase kinase, deft 11.7 

416602 NM.006159HS.79389 nel (chicken>lke 2 11.7 

433365 AF026944 Hs.293797 ESTs 11.6 

437866 AA156781 Hs.74170 metafothioriein 1E (functional) 11.5 

412472 AW975398 Hs.293836 ESTs 11.4 

416030 H15261 Hs.21948 ESTs 113 

439979 AW600291 Hs.6823 hypothetical protein FU10430 11 3 

420757 X78592 Hs.99915 androgen receptor (dihydrotestosterane f 113 

411598 BE336654 Hs.70937 H3 histone family, member A 11.2 

423600 AI633559 Hs.310359 ESTs 115 

430770 AA765694 Hs.123296 ESTs 11.0 

421037 AI684808 Hs.197653 programmed cell death 9 (PDCD9) ■ 105 

452461 N78223 Hs.108106 transcription factor 10.7 

409269 AA576953 Hs.22972 hypothetical protein FU13352 10.6 

417791 AW965339 Hs.1 11471 ESTs 10.6 

447268 AI370413 Hs.36563 hypoihet'caf protein FU2241 6 104 

424001 W67883 Hs.1 37476 paternally expressed 10 10-4 

447342 AI199268 Hs.19322 Homo sapiens, Similar to RIKEN cDNA 2010 10.4 

424905 NM_002497Hs.153704 NfMA (never in mitosis gene a)-refated k 10.1 

453619 H87648 Hs.33922 Homo sapiens, done MGC:9084, mRNA. comp10.1 

442942 AW167087 Hs.131562 ESTs 10.1 

434377 AW137148 Hs.306593 Homo sapiens cDNA FLJ11382 fis, clone HE 10.1 

427217 AA399272 Hs.144341 ESTs 10.1 

445730 AI624342 Hs.170042 ESTs 10.0. 

432887 AI926047 Hs.1 62859. ESTs 10.0 

452243 AL355715 Hs.28555 programmed eel) death 9 9.9 . 

424590 AW966399 Hs.46821 hypometical protein FU2Q086 9.9 

432169 Y00971 Hs.2910 phosphoribosyl pyrophosphate synthetase 9.9 

438950 H23789 Hs.144530 EST 9.9 

418836 AI655499 Hs.161712 ESTs 9.8 

430291 AV660345 Hs.238126 CGW9 protein 9.8 

444665 BE613126 Hs.47783 B aggressive lymphoma gene 9.7 

407377 C16391 gb:C16391 Clontech human aorta polyA mRN 9.7 

445413 AA151342 Hs.12677 CGI-147 protein 9.7 

443462 A1064690 Hs.171176 ESTs 9.7 

442145 AID22650 Hs.8117 erbb2-tateracting protein ERBIN 9.7 

435570 AF212222 Hs.177812 uncharacterized bone marrow protein BM04 9.7 

439820 AL360204 Hs.283853 Homo sapiens mRNA full length insert cDN 9.6 

428966 AF059214 Hs.194687 cholesterol 25-hydroxylase 9.6 

449448 D60730 Hs,57471 ESTs 9.6 

433929 AI375499 Hs.27379 ESTs 9.5 

432731 R31 178 Hs.287820 fibronectlnl 9.3 

411815 AA156679 Hs.125790 leudne-rich repeat-containing 2 9.3 

415385 R17798 Hs.7535 COBW4ike protein 9.3 

422026 U80736 Hs.1 10826 trinucleotide repeat containing 9 9.2 

432596 AJ224741 Hs.278461 matrffin3 9.2 

439451 AF086270 Hs.278554 heterochromatin-like protein 1 95 

423945 AA410943 gb2t32h03.r1 Soares ovary tumor NbHOT H 9.1 

442432 BE093589 Hs.38178 hypothetical protein FU23468 9.1 

446715 AI337735 Hs.173919 ESTs, Moderately similar to ZN91_HUMAN Z 9.0 

408771 AW732573 Hs.47584 potassium voltage-gated channel, delayed 9.0 

437021 AI076089 Hs.292239 ESTs 9.0 

428479 Y00272 Hs.184572 cell division cycte 2, G1 to S and G2 to 8.9 

428839 AI767756 Hs.82302 Homo sapiens cDNA FU 14814 fis, clone NT 8.9 

402408 NA NM_030920*:Homo sapiens hypothetical pro 8.9 

418601 AA279490 Hs.86368 calmegin 8.8 

426327 W03242 Hs.44898 Homo sapiens clone TCCCTA001 51 mRNA sequ8.8 

419519 AI198719 Hs.176376 ESTs 8.8 

440621 AW296024 Hs.150434 ESTs 8.8 

446142 AI754693 Hs.145968 ESTs 8.8 

418196 A1745649 Hs.26549 KIAA1708 protein 8.7 

'447178 AW594641 Hs.192417 ESTs '8.7 

322 



♦ 

WO 02/059377 : PCT/US02/02242 

\ 




427585 D31152 Hs.179729 collagen, type X, alpha 1 (Schmid metaph 8.6 
415857 AA866115 Hs.127797 Homo sapiens cONA FU1 1381 fis, done HE 8.6 



435061 AI651474 Hs.163944 ESTs 8.6 

431374 BE258532 Hs.251871 CTP synthase 8.4 

5 417866 AW067903 Hs.82772 collagen, type XI, alpha 1 8.4 

437211 AA382207 Hs.5509 ecotroplc viral Integration site 2B 8.3 

. 437751 AA767373 Hs.35669 ESTs, Moderately similar to ALU1_HUMAN A 8.3 

423887 AL080207 Hs.134585 DKFZP434G232 protein 8.2 

440941 BE268362 Hs.7535 COBW-like protein 8.2 

1 0 429859 NM_007050Hs.225952 protein tyrosine phosphatase, receptor t 8.2 

410193 AJ132592 Hs.59757 zinc finger protein 281 B2 

431725 X65724 Hs.2839 Norrie disease (pseudoglloma) 8.1 

446258 AI283476 . Hs.263478 ESTs 8.1 

416747 AW876523 .Hs.15929 hypothetical protein RJ 12910 8.1 

15 434424 AI811202 Hs.325335 Homo sapiens cONA: FU23523 fis, clone L 8.1 

421650 AA781795 Hs.122587 ESTs 8,0 

429534 AW976987 Hs.163327 ESTs, Weakly similar to 2109260A B cell 8.0 

457465 AW301344 Hs.1 22908 DNA replication factor 8.0 

427961 AW293165 Hs.143134 ESTs 8.0 

20 436481 AA379597 Hs.5199 HSPC150 protein similar to ublqulfln-oon 8.0 

418216 AA662240 Hs.283099 AF15q14 protein 8.0 

418250 U29926 Hs.83918 adenosine monophosphate deaminase (isofo 7.9 

400285 NA Eos Control 7.9 

401464 AF039241 Hs.9028 histone deacetyiase 5 7.9 

25 407242 M18728 gb:Human nonspecific crossreacting antig 7.8 

422232 D43945 Hs.1 13274 transcription factor EC 7.8 

454024 AA993527 Hs.293907 hypothetical protein RJ23403 7.8 

444542 AM61293 Hs.280380 amfnopeptidase 7.8 

436396 A1683487 Hs.1 52213 wingless-type MMTV Integration site faml 7.7 

30 437204 AL110216 Hs.12265 . ESTs, Weakly similar to 155214 salivary 7.6 

408805 H69912 Hs.48269 vaccinia related kinase 1 7.6 

437207 T27503 Hs.15929 hypothetical protein FU1 2910 7.6 

442818 AK001741 Hs.8739 hypothetical protein FU 10879 7.6 

426283 NMJ03937Hs.1 69139 kynureninase (L-kynurenine hydrolase) 7.5 

3 5 424687 J05070 Hs.1 51738 matrix metailoproteinase 9 (gelatlnase B 7.5 

446315 NM_016293Hs.14770 bridging integrator 2 7.5 

433426 H69125 Hs.133525 ESTs 7.5 

406639 R&7711 gfcHuman T-cefl receptor (V beta 18.1, J 7.5 

420077 AW512260 Hs.87767 ESTs 7.4 

40 457332 AA961694 Hs.1 05187 kinesin protein 9 gene 7.4 

422938 NM_001809Hs.1594 centromere protein A (17kD) 7.4 

447555 AI391662 Hs.160963 Homo sapiens, clone MGC:1231 8, mRNA,com7.4 

444618 AV653785 Hs.173334 ELL-RELATED RNA POLYMERASE II, ELONGATI07.3 

410361 BE391804 Hs.62661 guanyiate binding protein 1, interferon- 7.3 

45 400268 NA NMJ03292:Homo sapiens translocated prom 7.3 

439509 AF086332 Hs.58314 ESTs 7.3 

407771 AL138272 Hs.62713 ESTs 7.3 

407202 N58172 Hs;i09370 ESTs 7.3 

433096 AU076803 Hs.282975 carboxylesterase 2 (Intestine, liver) 7.2 

50 422094 AF129535 Hs.272027 F-box only protein 5 . " 7.1 

430832 AI073913 Hs.100686 ESTs, Weakly similar to JE0350 Anterior 7.1 

430287 AW182459 Hs.125759 ESTs, Weakly similar to LEU5.HUMAN LEUKE7.0 

423739 AA398155 Hs.97600 ESTs 7.0 

448212 AM75858 gb:tc87d07j<1 NCLCGAP.CLL1 Homo sapiens7.0 

55 407277 AW170035 Hs.326736 Homo sapiens breast cancer antigen NY-BR 7.0 

454440 BE062906 Hs.28338 K1AA1546 protein 7.0 

444783 AK001468 Hs.62180 anlliin (Drosophila Scraps homolog), act 7.0 

421373 AA606229 Hs.167771 ESTs 6.9 

431960 AW241821 Hs.301927 c6.1A 6.9 

60 424704 AI263293 Hs.152095 cytochrome P450, subfamily IU (arachido 6.8 

449517 AW500106 Hs.23643 serine/threonine protein kinase MASK 6.8 

439840 AW449211 Hs.105445 GDNF family receptor alpha 1 6.8 

414080 AA135257 Hs.47783 B aggressive lymphoma gene 6.8 

441243 AI767056 Hs.193002 ESTs 6.7 

65 408380 AF123050 Hs.44532 diublqultin 6.7 

422956 BE545072 Hs.122579 hypothetical protein FU10461 6.7 

446651 AA393907 Hs.97179 ESTs '6.7 
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437740 AA810265 Hs.122915 ESTs 67 

421582 AI910275 Hs.1406 trefoil factor 1 (pS2) 6.7 

427356 AW023482 Hs.97849 ESTs 6.6 

429597 NM_0Q3816Hs.2442 a dlsintegrin and metailoprotelnase doma 6.6 

422634 NM_016010Hs.1 18821 CGI-62 protein 6.6 

421072 AI215069 Hs.89113 ESTs 6.5 

427718 AI798680 Hs.25933 ESTs 6.5 

411000 N40449 Hs.201619 ESTs, Weakly similar to S38383 SEB4B pro 6.5 

449343 AI151418 Hs.272458 protein phosphatase 3 (formerly 2B), cat 6.4 

409757 NMJ301898HS.123114 cystafin SN . 6.4 

447164 AF026941 Hs.17518 Homo sapiens clg5 mRNA, partial sequence 6.4 

456938 X52509 Hs.161640 tyrosine aminotransferase 6.4 

418848 AI820961 Hs.193465 ESTs 6.4 

424902 NM_003866Hs.153687 inositol polyphosphate-4-phosphatase, ty 6.4 

452838 U65011 Hs.30743 preferentially expressed antigen in mela 6.4 

439452 AA918317 Hs.57987 B-cell CLL/lymphoma 118 (zinc finger pro 6.4 

407266 AJ235664 gb:Homo sapiens mRNA for immunoglobulin 6.3 

411078 AI222020 Hs.182364 CocoaCrisp , 63 

433001 AF217513 Hs.279905 clone HQ0310 PRO0310p1 6.3 

434340 AI193043 Hs.1 28685 ESTs, Weakly similar to T17226 hypothec ,6.2 

429503 AA394183 Hs.26873 ESTs 6.2 

402578 C1001134:gil2117372|pln:|l65981 fatty ac 6.2 

409646 AW161391 Hs.709 deoxycytidlne kinase 6.1 

430447 W17064 Hs.332848 SWI/SNF related, matrix associated, actr . 6.1 

432415 T16971 Hs.289014 ESTs, Weakly simBar to A43932 mucin 2 p 6.1 

443709 AI082692 Hs.134662 ESTs 6.1 

420929 AI694143 Hs.296251 programmed cell death 4 6.1 

428368 BE440042 Hs.83326 matrix metalloprotelnase 3 (stromelysln ' 6.1 

428248 AI126772 Hs.40479 ESTs 6.0 

420344 BE463721 Hs.97101 putative 6 protein-coupled receptor 6.0 

453392 U23752 Hs,32964 SRY (sex determining region Y^box 1 1 6.0 

425397 J04088 Hs.156346 topoisomerase (DNA) II alpha (170kD) . 6.0 

416007 M13509 Hs.83169 matnx metalloprotelnase 1 (MMP1; inters 6.0 

428585 AB007863 Hs.165140 KIAA0403 protein 6.0 

437608 AA761605 Hs.292308 ESTs, Weakly similar to ALU1_HUMAN ALU S6.0 

427408 AA583206 Hs.2156 RAR-related orphan receptor A 6.0 

406687 M31126 Hs.272620 matrix metalloproteinase 11 (MMP11; stro 6.0 

418092 R45154 Hs.106604 ESTs 6.0 

447051 AW139130 Hs.160951 ESTs, Weakly sirrta to Con1 [H.sapiens] 6.0 

441233 AA972965 Hs.135568 ESTs 6.0 

432239 X81334 Hs.2936 matrix metalloproteinase 13 (collagenase 6.0 

435106 AA100847 Hs.193380 ESTs, Highry similar to AF174600 1 F-tox 5.9 

435525 AI831297 Hs.123310 ESTs 5.9 

458809 AW972512 Hs.20985 sin3-associated polypeptide, 30kD 5.9 

410785 AW803341 gb:IL24JM0079490300-05(M303 UM0079 Homo5.9 

422576 BE548555 Hs.1 18554 CGl-83 protein . 5.9 

451398 AI793124 Hs.144479 ESTs 5.9 

441881 AW968904 Hs.179566 hypothetical protein FU22624 5.8 

412022 AIC05043 Hs.24143 Wiskott-Aldrich syndrome protein interac 5.8 

416636 N32536 Hs.42645 solute carrier family 16 (monocarboxylic 5.8 

447350 AI375572 Hs.172634 ESTs 5.8 

434094 AA305599 Hs.238205 hypothetical protein PRO2013 5.8 

409151 AA306105 Hs.50785 SEC22, vesicle traffiddng protein (S. c 5.8 

448807 AI571940 Hs.7549 ESTs 5.8 

452281 T93500 Hs.28792 Homo sapiens cONA FLJ1 1041 fis, done PL 5.8 

421281 AI299139 Hs.17517 ESTs 5.8 

430361 AI033965 Hs.239926 sterol-C4-methyl oxldase-like 5.8 

400289 X07820 Hs.2258 matrix metalloproteinase 10 (MMP10; str 5.7 

440527 AV657117 Hs.184164 ESTs, Moderately similar to S65657 alpha 5.7 

434674 AA831879 Hs.136985 ESTs 5.7 

426320 W47595 Hs.1 69300 transforming growth factor, beta 2 5.7 

452401 NMJX)7115Hs.29352 tumor necrosis factor, alpha-induced pro 5.7 

448663 BE614599 Hs.106823 hypothetical protein MGC14797 5.7 

438199 AW016531 Hs.122147 ESTs 5.7 

446203 Z47553 Hs.14286 flavin containing monooxygenase 5 5.7 

428336 AA503115 Hs.183752 microseminoprotein, beta- 5.6 
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430379 AF134149 Hs,240395 potassium channel, subfamily K, member6 5.6 
422835 BE218705 Hs.121378 metatlolhionein-iike 5, testis-spectfic 5.6 
444758 AL044878 Hs.11899 3-hydroxy-3-me%lglufaryl-Coenzyme A re 5.6 
443426 AF098158 Hs.9329 chromosome 20 open reading frame 1 5.6 
5 400301 X03635 * Hs.1657 estrogen receptor 1 5.6 
447078 AW885727 Hs.301570 ESTs 5.6 
432015 AL157504 Hs.159115 Homo sapiens mRNA; cONA DKFZp586O0724 (£5.5 
438691 AA906288 Hs.212184 ESTs 5.5 
439809 R41396 Hs. 101774 hypothetical protein FU23045 5.5 

10 415786 AW419196 Hs.257924 hypothetical protein FU13782 . 5.5 
456373 BE247706 Hs.89751 membrane-spanning ^mains, subfamily A 5.5 
401645 NA C16001440*:gi|12330704|gb|AAG52890.1|AF355 
437967 BE277414 Hs.5947 mel transforming oncogene (derived from 5.5 
445885 AI734009 Hs.127699 KIAA1 603 protein 5.4 

15 439138 AJ742605 Hs.193696 ESTs 5.4 
440270 NIVL015986Hs.7120 cytokine receptor-like molecule 9 5.4 
437536 X91221 Hs.144465 ESTs 5.4 
436167 R28363 . Hs.24286 ESTs 5.4 
452741 BE392914 Ms.30503 Homo sapiens cDNA FU11344 fis, done PL 5.4 

20 426214 H59846 Hs.1 28355 ESTs, Moderately similar to AUJ7_HUMAN A 5.4 
413554 AA319146 Hs.75426 secretogranin II (chromogranin C) 5.4 
422867 L32137 Hs.1584 cartilage oligomeric matrix protein (COM 5.4 
434263 N34895 Hs.44648 ESTs 5.4 
446382 AW205168 Hs.150823 ESTs 5.4 

25 422406 AF025441 Hs,116206 Opa-interacting protein 5 5.3 
438321 AA576635 Hs.6153 CGI-48 protein 5.3 
418310 AA814100 Hs.86693 ESTs 5.3 
419625 U91616 Hs.91640 nuclear factor of kappa light polypepfid 5.3 
450701 H39960 Hs.288467 Homo sapiens cDNA FIJI 2280 fis, done MA 5.3 

30 445900 AF070526 Hs.13429 Homo sapiens done 24787 mRNA sequence 5.2 
449051 AW961400 Hs.333526 HER2 receptor tyrosine kinase (c-erb-b2, 5.2 
418758 AW959311 Hs.172012 hypothetical protein DKFZp434J037 5.2 
431070 AW408164 Hs.249184 transcription factor 19 (SC1) 5.2 
417079 U65590 Hs.61134 interteukin 1 receptor antagonist 5.2 

35 421928 AF013758 Hs.1 09643 polyadenyiate binding protein-lnteractin 52 
428804 AK000713 Hs.193736 hypothetical protein FU20706 5.2 
427427 AF077345 Hs.177936 ESTs 5.2 
403485 C3001813 , :gi|12737279|ref|XP^012163.1|k 5.2 

422168 AA586894 Hs.1 12408 S100 caldum-binding protein A7(psortas 5.1 

40 421937 AI878857 Hs.109706 hematological and neurological expressed 5.1 
426752 X69490 Hs.172004 titln 5.1 
453310 X70697 Hs.553 solute carrier family 6 (neurotransmitte 5.1 
423198 M81933 Hs.1634 cell division cyde 25A 5.1 
412281 AI810054 Hs.14119 ESTs 5.1 

45 447513 AW955776 Hs.313500 ESTs, Moderately similar to ALU7_HUMAN A 5.1 
453931 AL121278 Hs.25144 ESTs 5.1 
404347 Target Exon 5.1 

. 431808 M30703 Hs.270833 amphiregulin (schwannoma-derived growth 5.1 
429113 028235 Hs. 196384 prostaglandin-endoperoxlde synthase 2 (p 5.1 

50 436291 BE568452 Hs.5101 protein regulator of cytokinesis 1 5.1 
450603 R43646 Hs.12422 ESTs 5.1 
434725 AK000796 Hs.4104 hypothetical protein 5.0 ' 

435981 H74319 Hs.188620 ESTs 5.0 
407376 AA993138 Hs.142287 ESTs, WeaWy similar to ALUFJWMAN llll 5.0 

55 431689 AA305688 Hs.267695 UDP-Gal:betaGlcNAc beta 1,3-galariosyllr 5.0 
405348 NA C7001664:gi|12698061|dbj|BAB21849.1|(AB 5.0 

436196 AK001084 Hs.333498 Homo sapiens cONA RJ 10222 f(s, done HE 5.0 
437065 AL036450 Hs.103238 ESTs 5.0 
410196 AI936442 Hs.59838 hypothetical protein FU1 0808 5.0 

60 429412 NM.006235Hs.2407 POU domain, dass 2, associating factor 5.0 
446619 AU076643 Hs.313 secreted phosphoprotein 1 (osteopontin, 4.9 
403329 NA Target Exon 4.9 

442875 BE623003 Hs.23625 Homo sapiens done TCCCTA00142 mRNA sequ4.9 
442441 AI820662 Hs.129598 ESTs 4.9 

65 430375 AW371048 Hs.93758 H4 Wstone family, member H 4.9 

424128 AW966163 gb:EST378236 MAGE resequences, MAGI Homo4.9 

408873 AL046017 Hs.1 82278 calmodulin 2 (phosphorylase kinase, delt 4.9 
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407910 AA650274 Hs.41296 fibronedin leucine rich transmembrane p 4.9 

432608 NWl.Q02104Hs.3Q66 granzyme K (serine protease, granzyme 3; 4.9 

453204 R10799 Hs.191990 ESTs 4.8 

452020 AA722012 Hs.255757 ESTs, Weafdy similar to AT2A_HUMAN POTEN4.8 

5 449048 245051 Hs.22920 similar to S68401 (cattle) glucose Indue 4.8 

408369 R38438 Hs. 182575 solute carrier family 15 (H?7? transport 4.8 

431645 AF078849 Hs.266483 dynefo light chalrvA 4.8 

423575 C1B863 Hs.163443 Homo sapiens cDNA RJ11576 fis, clone HE 4.8 

444246 H93281 Hs.10710 hypothetical protein RJ20417 4.8 

10 421524 AA312082 Hs.105445 GDNF family receptor alpha 1 4.8 

452827 AI571835 Hs.55468 ESTs 4.8 

414222 AL135173 Hs.878 sorbitol dehydrogenase 4.8 

456086 AL161999 Hs.77324 eukaryotic translation termination facto 4,8 

419078 M93119 Hs.89584 insullnoma-assodated 1 4.8 

15 418973 AA233056 Hs.191518 ESTs 4.8 

447033 AI357412 Hs.157601 ESTs 4.8 

451621 AI879148 Hs.26770 fatty add binding protein 7, brain 4.7 

419968 XD4430 Hs.93913 interteukin 6 (interferon, beta 2) 4.7 

424326 NNL014479Hs.145296 disintegrin protease 4.7 

20 431585 BE242803 Hs.262823 hypothetical protein FU10326 4.7 

429294 AA095971 Hs.198793 Homo sapiens cDNA: FLJ22463 fis, done H 4.7 

416814 AW192307 Hs.80042 dollchy^lc:Man9GlcNAc2-PP-dollchylgl 4.7 

439897 NM.015310Hs.6763 KIAA0942 protein 4.7 

429687 AI675749 Hs.211608 nudeoporin 153kD 4.7 

25 422880 AF228704 Hs.121524 glutathione reductase 4.7 
405801 NrvL000390:Homo sapiens choroideremla (Ra4.6 

432435 BE218886 Hs.282070 ESTs 4.6 

439544 W26354 Hs.28891 hypothetical protein FU 11 360; artemls p 4.6 

425354 U62027 Hs.155935 complement component 3a receptor 1 4.6 

30 436027 AI864053 Hs.39972 ESTs, Weakly similar to I38588 reverse t 4.6 

424623 AW963062 Hs.337404 ESTs 4.6 

. 403366 NA Target Exon 4.6 

402542 Target Exon 4.6 

450193 AI916071 Hs.15607 Homo sapiens Fanconi anemia complementat 4.6 

35 411678 AI9071 14 Hs.71465 squalene epoxidase 4.6 

456844 AI264155 Hs.152981 CDP-diacytglycerol synthase (phosphatida 4.6 

448072 A1459306 Hs.24908 ESTs 4.5 

408045 AW138959 Hs.245123 ESTs 4.5 

423782 AW72209 Hs.323117 ESTs 4.5 

40 447388 AW630534 Hs.76277 Homo sapiens, <toneMGC:9381,mRNA,comp4.5 

448140 AF146761 Hs.20450 BCM-Iike membrane protein precursor 4.5 

452561 A1692181 Hs.49169 KIAA1 634 protein 4.5 

425331 AW962128 gb:EST374201 MAGE resequences, MAGG Homo4.5 

428801 AW277121 Hs.254881 ESTs . 4.5 

45 428500 AI815395 Hs.184641 fatty add desaturase 2 4.5 

426075 AW513691 Hs.270149 ESTs, WeaWy similar to 2109260A B cell 4.4 

437259 AI377755 Hs.120695 ESTs 4.4 

400409 AF153341 Hs.283954 Homo sapiens'winged heBx/forkhead trans 4.4 

412863 AA121673 Hs.59757 zinc finger protein 281 4.4 

50 426989 AI815206 Hs.99395 ESTs 4.4 

401866 Target Exon 4.4 

418819 AA228776 Hs.191721 ESTs 4.4 

406348 Target Exon 4.4 

412138 AW895387 gb:QV4^N0038^00300-167<10NN0038Homo4.4 

55 428550 AW297880 Hs.98661 ESTs 4.4 

411743 AW862214 gb:QV4-CT0361-301299-074-b05 CT0361 Homo4.4 

429966 BE081342 Hs.283037 HSPC039 protein 4.4 

423291 NM_004129Hs.126590 guanylatecydasel, soluble, beta 2 4.4 

423456 AL110151 Hs.128797 DKFZP586D0824 protein 4.4 

60 452190 H26735 Hs.91668 Homo sapiens done PP1498 unknown mRNA 4.4 

424871 NMJ04525Hs.1 53595 low density lipoprotein-related protein 4.3 

429575 AA706003 Hs.99387 ESTs 4.3 

429922 Z97630 Hs.2261 17 H1 histone family, member 0 4.3 

421379 Y15221 Hs.103982 small indudble cytokine subfamily B (Cy 4.3 

65 400300 X03363 HER2 receptor tyrosine kinase (c-erb-b2, 4.3 

437258 AL041243 Hs.174104 ESTs 4.3 

446595 T57448 Hs. 15467 hypothetical protein FU20725 4.3 
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403011 NA ENSP0O00O215330*:Probableserine/ihreoni 4.3 

419055 AI365384 Hs.11571 Homo sapiens cDNAFU11570fis, clone HE 4.3 

418661 NM_001949Hs.1189 E2F transcription factor 3 4.3 

407786 AA687538 Hs.38972 tetraspan 1 4.3 

429183 AB014604 Hs.197955 KIAA0704 protein 4.3 

442914 AW188551 Hs.99519 hypothetical protein FU14007 . 4.3 

441029 AI091795 Hs.179246 ESTs 4.3 

452194 AI694413 Hs.332649 olfactory receptor, family 2, subfamily 4.3 

414821 M63835 Hs.77424 Fc fragment of IgG, high affinity la. re 4.2 

410102 AW248508 Hs.279727 Homo sapiens cONA FU14035 fis, clone HE 4.2 

452110 T47667 Hs.28005 Homo sapiens cONA FU11309 fis, clone PL 4.2 

442007 AA301116 Hs.142838 nucleolar phosphoprotein Nopp34 4.2 

417318 AW953937 Hs.12891 ESTs 4.2 

431818 AW510444 Hs.191705 ESTs, Weakly similar to T47184 hypotheti 4.2 

443646 AI08519S Hs.164226 ESTs 4.2 

419169 AW851980 Hs.262346 ESTs, Weakly similar to S72482 hypotheti 4.2 

446839 BE091926 Hs.16244 mitotic spindle coiled-coil related prot 4.2 

423242 AL039402 . Hs.125783 DEME-6 protein 4.2 

432116 AA902953 Hs.308538 ESTs 4.2 

409038 T97490 Hs.50002 small Inducible cytokine subfamily A (Cy 4.2 

445625 BE246743 Hs.288529 hypothetical protein FU22635 4.2 

425139 AW630488 Hs.325820 protease, serine, 23 4.2 

447397 BE247676 Hs.18442 E-1 enzyme 4.2 

410166 AK001376 Hs.59346 hypothetical protein FU10514 4.1 

437295 AW779318 Hs.88417 ESTs 4.1 

430486 BE062109 Hs.241551 chloride channel, calcium activated, fam 4.1 

441790 AW294909 Hs.132208 ESTs 4.1 

410129 BE244074 Hs.58831 regulator of Fas-Induced apoptosis 4.1 

427521 AW973352 Hs.290585 ESTs 4.1 

425247 NM_005940Hs.155324 matrix metalioproteinase 11 (MMP11; stro 4.1 

412886 AF041163 Hs.74647 Human T-ceB receptor active alpha-chain 4.1 
441153 BE562826 gb:601336534F1 NIHJ/IGCJ4 Homo sapiens c4.1 

444301 AK000136 Hs.10760 asporin (LRR class 1) 4.1 

426711 AA383471 Hs.180669 conserved gene amplified in osteosarcoma 4.1 

405850 NA Target Exon 4.1 

440283 AI732892 Hs.190489 ESTs 4.0 

432441 AW292425 Hs.163484 ESTs 4.0 

400284 NA estrogen receptor 1 4.0 

417341 N91453 Hs.102987 ESTs 4.0 

429732 U20158 Hs.2488 lymphocyte cytosoiic protein 2 (SH2 doma 4.0 

411393 AW797437 Hs.69771 B-factor, properdin 4.0 

425704 U79293 Hs. 159264 Human clone 23948 mRNA sequence 4.0 

419594 AA013051 Hs.91417 topotsomerase (DNA) II binding protein 4.0 

419092 J05581 Hs.89603 mucin 1, transmembrane 4.0 

443147 AI034351 Hs.19030 ESTs 4.0 

408633 AW963372 Hs.46677 PRO2000 protein 4.0 

433404 T32982 Hs. 102720 ESTs 4.0 

.421506 BE302796 Hs.105097 thymidine kinase 1, soluble 4.0 

417900 BE250127 Hs.62906 CDC20 (cell division cycle 20, S. cerevi 3.9 
414602 AW630088 Hs.76550 Homo sapiens mRNA; cDNA DKFZp564B1264 (f3.9 

413762 AW411479 Hs.848 FK506-blndIng protein 4 (59kD) 3.9 

404580 N M.0141 12 fc .Homo sapiens trichorhinophala 3.9 

452046 AB018345 Hs.27657 KIAA0802 protein 3.9 
459587 AA031956 gb:zk1 5e04.s1 Soares_pregnanUrterus_NbH 3.9 

416658 U03272 Hs.79432 fibrillin 2 (congenital contracture) ara 3.9 

426647 AA243464 Hs.294101 pre-B-cell leukemia transcription factor 3.9 

429353 AL117406 Hs.200102 ATP-blnding cassette transporter MRP8 3.9 

419038 AW134924 Hs.190325 ESTs 3.9 

418918 X07871 Hs.69476 CD2 antigen (p50), sheep red blood cell 3.9 

421977 W94197 Hs.1 10165 ribosomal protein L26homotog 3.9 

442567 AI201183 Hs.130251 ESTs 3.9 

421168 AF182277 Hs.330780 cytochrome P450, subfamily IIB (phenobar 3.9 
431701 AW935490 Hs.14658 Human chromosome 5q13.1 clone 5G8 mRNA3.9 

418526 BE019020 Hs.85838 solute carrier family 16 (monocarboxylic 3.9 

414998 NlwL002543Hs.77729 oxidised low density lipoprotein (lectin 3.9 

422790 AA809875 Hs.25933 ESTs 3.9 

419741 NRL007019HS.93002 ubiquitm carrier protein E2-C 3.9 
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430017 AA263172 Hs.35 protein tyrosine phosphatase, norwecept 3.9 

458814 AI498957 Hs.170861 ESTs, Weakly similar to Z195.HUMAN ZINC 3.8 

428514 AW236861 Hs.193139 ESTs 3.8 

434521 NMJ)02267Hs.3886 karyopherin alpha 3 (importin alpha 4) 3.8 

5 409425 U40462 Hs.54452 zinc finger protein, subfamily 1 A, 1 (Ik 3.8 

439560 BE565647 Hs.74899 hypotheticaJ protein FU1282Q 3.8 

424028 AF055084 Hs.153692 Homo sapiens cDNA FU14354 fis, done Y7 3.8 

400021 . AFFX control -HUMISGF3A/M97935J/IA 3.8 

453403 BE466639 Hs.61779 Homo sapiens cDNA FU13591 fe, clone PL 3.8 

10 445941 AI267371 Hs.172636 ESJs 3.8 

434378 AA631739 Hs.335440 EST 3.8 

429220 AW207206 Hs.136319 ESTs 3.8 

439176 AI446444 Hs.190394 ESTs, Weakly similar to B28096 iine-1 pr 3.8 

401045 C11001883*:gf|6753278IrefINP^.033938.1|c 3.8 

15 430178 AW449612 Hs.152475 ESTs 3.8 

423397 NM_001838Hs.1652 chemokine (C-C motif) receptor 7 3.8 

447630 AI660149 Hs.44865 tymphoid enhancer-binding factor 1 3.8 

436391 AJ227892 Hs.146274 ESTs 3.8 

413011 AW068115 Hs.821 biglycan 3.8 

20 422121 AI767949 Hs. 179833 ESTs 3.8 

452268 NM_003512Hs28777 H2A histone family, member L 3.8 

427811 M81057 Hs.180884 carboxypeptidase B1 (tissue) 3.8 

415579 AA165232 Hs.222069 ESTs 3.8 

437330 AL353944 Hs.50115 Homo sapiens mRNA; cDNA DKFZp761 J11 12 (f 3.8 

25 427122 AW057736 Hs.323910 HER2 receptor tyrosine kinase (c-erb-b2, 3.7 

400286 NA C16(K)0922:gl|7499103|plr||T20903hypothe 3.7 

420281 AI623693 Hs.191533 ESTs 3.7 

419926 AW900992 Hs.93796 DKFZP586D2223 protein 3.7 

417541 AI992191 Hs.1 80040 hypotheticaJ protein FU22439 37 

30 426172 AA371307 Hs.125056 ESTs 3.7 

429638 AI916662 Hs.2 11 577 kinectin 1 (kfnesin receptor) 3.7 

457001 J03258 Hs.2062 vitamin D (1,25- dihydroxyv'rtamin D3) re 3.7 

424109 AW406878 gb:UWF-BLO-adg-g-06-0-Ul.r1 NIHjMGC_373.7 

417022 NM_014737Hs.80905 Ras association (RalGDS/AF-6) domain fam 3.7 

35 436222 AI208737 Hs.122810 Homo sapiens cDNA FU1 1489 fis, clone HE 3.7 

430448 AI633553 Hs.13303 Homo sapiens cDNA: FU21784 fis, done H 3.7 

432729 AK000292 Hs.278732 hypotheticaJ prated FU20285 3.7 

413916 N49813 Hs.75615 apolipoprotein C-li - 3.7 

421662 NM.014141HS.106552 cell recognition molecule Caspr2 3.7 

40 441633 A W958544 Hs. 112242 normal mucosa of esophagus specific 1 3.7 , 

408761 AA057264 Hs.238936 ESTs, WeaWy similar to (defline not ava 3.7 

406153 Target Exon 3.7 

445563 AW873606 Hs.149006 ESTs 3.7 

453464 AI884911 Hs.32989 receptor (calcitonin) activity modifying 3.7 

45 448918 AB011152 . Hs.22572 KtAA0580 protein 3.7 

413936 AF113676 Hs.297681 serine (or cysteine) proteinase inhibito 3.6 . 

448069 U76248 Hs.20191 seven in absentia (Drosophtla) homolog 2 3.6 

453313 BE005771 Hs.153746 hypothetical protein FU22490 3.6 

425234 AW152225 Hs.165909 ESTs, Weakly similar to I38022 hypothetl 3.6 

50 419941 X98654 Hs.93837 phosphatidylinositoi transfer protein, m 3.6 

402397 AF166625 Hs.189507 phosphoiipase A2, group liO 3.6 

430378 Z29572 Hs.2556 tumor necrosis factor receptor superfami 3.6 

448106 AI800470 Hs.171941 ESTs 3.6 

426431 NMJ»0458Hs;169853 transcripfion factor 2, hepatic; LF-B3; 3.6 

55 431843 AA516420 Hs.183526 ESTs, WeaWy similar to 138022 hypotheti 3.6 

426878 BE069341 gb:QV3-BT0381-2701004)73-c08BT0381 Homo3.6 

434061 AW024973 Hs.283675 NPD009 protein 3.6 

445292 AV653264 Hs.13982 Homo sapiens cDNA FU14666 fis, done NT 3.6 

452101 T60298 Hs.10844 Homo sapiens cDNA FLJ 14476 fis, done MA 3.6 

60 427581 NNL014788HS.179703 KIAA0129 gene product 3.6 

409047 AW961434 Hs.31539 ESTs 3.6 

416820 NM_000402Hs.80206 glucose-6-phosphate dehydrogenase 3.6 

410386 W26187 Hs.3327 Homo sapiens cDNA: FU22219 fe, done H 3.6 

440516 S42303 Hs.161 cadherfn 2, type 1,N-cadherin (neuronal 3.6 

65 434360 AW015415 Hs.127780 ESTs 3.6 

428970 BE276891 Hs.194691 retlnoic add induced 3 3.6 

415079 R43179 Hs^2895 hypothetical protein FU23548 3.6 
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409619 AK001015^Hs.55220 BCL2-assoclated athanogene 2 3.6 

430044 AA464510 Hs.152612 ESTs . 3.6 

430829 AW451999 Hs.194024 ESTs 3.6 

434224 AA380731 Hs.84 Interieukin 2 receptor, gamma (severe co 3.6 

439247 AF088020 Hs.46767 EST 3.6 

431542 H63010 Hs.5740 ESTs 3.5 

430713 AA351647 Hs.2642 eukaryotic translation elongation factor 3.5 

434988 AI418055 Hs.161160 ESTs 3.5 

437748 AF234882 Hs.5814 suppression of tumorigenicity 7 3.5 

418322 AA284166 Hs.84113 cydln-dependent kinase inhibitor 3 (CDK 3.5 

439569 AW602166 Hs.222399 CEGP1 protein 3.5 

459583 AI907673 gb:IL-BT152^)8039^004BT152Homosapien3.5 

403212 NMJ)19595:Homo sapiens Intersecfin 2 (IT 3.5 

409099 AK000725 Hs.50579 hypothetical protein RJ20718 3.5 

453968 AA847843 Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA3.5 

436338 W92147 Hs.1 1B394. ESTs 3.5 

422890 Z43784 Hs.75893 ankyrin 3, node of Ranvier (ankyrin G) 3.5 

442295 AI827248 Hs.224398 Homo sapiens cDNA FU11469 fis, done HE 35 

417975 AA641836 Hs.30085 hypothetical protein FU23186 3.5 

433730 AK002135 Hs.3542 hypothetical protein FLJ1 1273 . 3.5 

439926 AW014875 Hs.137007 ESTs 35 

445873 AA250970 Hs.251946 poly(A)-bfnding protein, cytoplasmic 1-1 3.4 

410153 BE311926 Hs.15830 hypothetical protein FU 12691 3.4 

422128 AW881145 gb:QVTW)T0033^1()400.182^07QT0033Homo 3 

414921 BE390551 Hs.77628 steroidogenic acute regulatory protein r 3.4 

441134 W29092 Hs.7678 cellular retinoic acid-binding protein 1 3.4 

444564 AI167877 Hs.143716 ESTs 3.4 . 

402470 NA Target Exon 3.4 

418120 AA213437 Hs.192249 ESTs 3.4 

422414 AW675237 Hs.13701 ESTs 3.4 

433345 AI681545 Hs.152982 hypothetical protein FLJ131 17 3.4 

409213 U61412 Hs.51133 PTK6 protein tyrosine kinase 6 3.4 

422611 AA158177 Hs.1 18722 fucosyitransferase8(alpha(1,6)fucosy .3.4 

423554 M90516 Hs.1674 glutamine-frudose^hosphatetransamin 3.4 

442619 AA447492 Hs.20183 ESTs, Weakly similar to AF 164793 1 prote 3.4 



402359 NA C19001991*:gl|12656111|gb|AAK00751.1|AF23.4 



439398 AA284267 Hs.221504 ESTs 3.4 

415208 F01020 Hs.1 72004 tWn 3.4 

452853 AA812633 Hs.10845 ESTs 3.4 

429345 R11141 Hs.199695 hypothetical protein 3.4 

449027 AJ271216 Hs.22880 dipeptidylpeptidase III 3.4 

412115 AK001763 Hs.73239 hypothetical protein FU10901 3.4 

432180 Y18418 Hs.272822 RuvB (E coO homo!og)-like 1 3.4 

428977 AK001404 Hs.194698 cycfin B2 3.4 

431611 U58766 Hs.264428 tissue specific transplantation antigen 3.4 

418286 AA622528 Hs.319825 Homo sapiens, clone IMAGE361 6574, mRNA,3.4 

436895 AF037335 Hs.5338 carbonic anhydrase XII (tumor antigen H 3.4 

443378 AW392550 Hs.9280 proteasome (prosome, macropain) subunit, 3.4 

428450 NM014791HS.184339 KIAA01 75 gene product 3.3 

449571 AW016812 Hs.200266 ESTs 3.3 

412777 AI335773 Hs.270123 ESTs 3.3 

420542 NlvL000505Hs.1321 coagulation factor XII (Hageman factor) 3.3 

412754 AW160375 Hs.74565 amyloid beta (A4) precursor-like protein 3.3 

418327 U70370 Hs.84136 paired-like homeodomaln transcription fa 3.3 

449065 AI627393 Hs.258998 ESTs, Weakly similar to high mobility gr 3.3 

425999 AW513051 Hs.332981 ESTs, Weakly similar to 138022 hypotheti 3.3 

430280 AA361258 Hs.237868 Interieukin 7 receptor 3.3 

407777 AA161071 Hs.71465 squalene epoxidase 3.3 

426516 BE262660 Hs.170197 glutamic-oxaloacetic transaminase 2, rrrit 3.3 

414361 AI086138 Hs.204044 ESTs 3.3 

427080 AW068287 Hs.173466 ras-reiated C3 botullnum toxin substrate 3.3 

426429 X73114 Hs.169849 myosin-binding protein C, slow-type 3.3 

446163 AA026880 Hs.25252 prolactin receptor 3.3 

428566 U41763 Hs.1 84916 clathrin, heavy polypeptide^ 1 3.3 

418641 BE243136 Hs.86947 a dislntegrin and metalloprotelnase doma 3.3 

436293 AI601188 Hs.120910 ESTs 3.3 



411257 AA628967 Hs.1 15274 ESTs, Highly similar to IHH_HUMAN INDIAN 3.3 
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430253 AK001514 Hs.236844 hypothetical protein FU10652 3.3 

430066 AI929659 Hs.237825 signal recognition particle 72kD 3.3 

436469 AK001455 Hs.5198 Down syndrome critical region gene 2 . 33 

437786 BE142681 Hs.155573 polymerase (DNA directed), eta 3.3 

5 444079 H09048 Hs.23608 ESTs 3.3 

457183 H91882 Hs.1 18569 Dvl-binding protein IDAX (inhibition of 33 

431215 AA496078 Hs.121554 Human DNA sequence from clone RP11-218C1 3.3 

424563 AA446932 Hs.151428 ret finger protein 2 . 3.3 . 

450828 AW270655 Hs.193804 ESTs 33 

10 408652 R43409 Hs.6829 Homo sapiens mRNA for KIAA1 644 protein, 3.3 

445142 AW978484 Hs.93842 Homo sapiens cDNA: FLJ22554 fls, clone H 3.3 

426761 AI015709 Hs.172089 Homo sapiens mRNA; cDNA DKF2p586l2022 (13.3 

439237 AW408158 Hs.318893 ESTs, Weakly similar to A47582 B-cell gr 33 

422616 BE300330 Hs.1 18725 selenophosphate synthetase 2 33 

15 443247 BE614387 Hs.333893 c-Myc target JPOI 3.3 

406663 U24683 Hs.302063 immunoglobulin heavy constant mu 33 

434137 AA907734 Hs.124895 ESTs 3.3 

408877 AA479033 Hs.130315 ESTs, Weakly simOar to A47582 B-cell gr 33 

439101 C01765 Hs.38750 hypothetical protein FU1 1526 3.3 

20 - 408221 AA912183 Hs.47447 ESTs 33 

447519 U46258 Hs.339865 ESTs 3.3 

404755 NA Target Exon 33 

451871 AI821005 Hs.1 18599 ESTs 3.2 

420319 AW406289 Hs.96593 hypothetical protein 3.2 

25 430580 AA806105 Hs.300697 immunoglobulin heavy constant gamma 3 (G 3.2 

400202 NA NMJJ02795*:Homo sapiens proteasome (pros3.2 

400222 NA NM_002082*:Homo sapiens G protein-couple 3.2 

425988 BE045897 Hs.274454 ESTs, Weakly similar to (38022 hypothefi 3.2 

456098 BE550224 Hs.74170 metaltothionein 1E (functional) 3.2 

30 430589 AJ002744 Hs.246315. UDP^cetyklpra-T>galactosamine:polyp 3.2 

431563 A1027643 Hs.120912 ESTs 3.2 

442353 BE379594 Hs.49136 ESTs, Moderately simflar to ALU7_HUMAN A 3.2 

422309 U79745 Hs.1 14924 solute carrier famfly 16 (monocarboxyRc 3.2 

419703 AI793257 Hs.128151 ESTs 3.2 

35 420380 AA640891 Hs.102406 ESTs 3.2 

410853 H04588 Hs.30469 ESTs 3.2 

454417 AI244459 Hs.110826 ^'nucleotide repeat containing 9 3.2 

432745 AI821926 gb:nt78f05.x5 NGI_CGAPJ ) r3 Homo sapiens 3.2 

422032 AA476966 Hs.1 10857 polymerase (RNA) Hi (DNA (ftected) poly 3.2 

40 415339 NMJ)15156Hs.78398 KIAA0071 protein 3.2 

426384 AI472078 Hs.303662 ESTs 3.2 

448030 N30714 Hs325960 membrane-spanning 4-domains, subfamily A 3.2 

418739 AA310964 Hs.88012 SHP2 interacting transmembrane adaptor 3.2 

442053 R35343 Hs.24968 Human DNA sequence from done RP1-233G163.2 

45 434747 AA837085 Hs.220585 ESTs 3.2 

427297 AW292593 Hs.334907 Homo sapiens, clone MGC:17333, mRNA, cdm3.2 . 

412228 AW503785 Hs.73792 complement component (3d/Epsteln Barrvi 3.2 

452304 AA025386 Hs.61311 ESTs, Weakly similar to S10590 cysteine 3.2 

453953 AW408337 Hs.36972 CD7 antigen (p41) 3.2 

50 407758 D50915 Hs.38365 KIAA0125 gene product 3.2 

451149 AL047586 Hs.10283 RNA binding motif protein 8B 3.2 

430015 AW768399 Hs.1 12157 ESTs 3.2 

433313 W20128 Hs.296039 ESTs 3.2 

418334 AA3 19233 H$£521 ESTs 3.2 

55 450223 AA418204 Hs.241493 natural killertumor recognition sequenc 3.2 

454365 AW966728 Hs.54642 methionine adenosyttransferase II, beta 3.2 

451128 AL118668 gb:DKFZp761 10310^1 761 (synonym: hamy2)3.2 

417793 AW405434 Hs.82575 small nuclear ribonucleoprotetn polypept 3.2 

428027 U22029 Hs334345 cytochrome P450, subfamily IIA (phenobar 3.2 

60 441197 BE244638 Hs.1 66 sterol regulatory element binding transc 3.2 

424634 NM_003613Hs.151407 cartilage intermediate layer protein, nu 3.2 

419986 AI345455 Hs.78915 GA-binding protein transcription factor, 3.2 

416714 AF283770 Hs.79630 CD79A antigen (immunoglobulin-associated 3.2 

449465 NM_004380Hs.23598 CREB binding protein (Rubinstein-Taybi s 3.2 

65 422166 W72424 Hs.1 12405 S100 calcium-binding protein A9 (caigran 3.2 

409079 W87707 Hs.82065 interieukin 6 signal transducer (gp130, 3^ 

423551 AA327598 Hs.233785 ESTs 3.2 
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453553 AA036849 Hs.61629 Homo sapiens cDNA FU 12763 fis, clone NT 3.2 

442580 AI733682 Hs.130239 ESTs 3.2 

458079 AI796870 Hs.54277 DNA segment on chromosome X (unique) 9923.2 

425700 AF076292 Hs.159251 forkheadboxHI 3.2 

5 417124 BE122762 ' Hs.25338 ESTs , . 3.2 

407104 S57296 Hs.323910 v-erb4>2 avian erythroblastic leukemia v 3.2 

442215 AI703172 Hs.129005 ESTs, WeaWy similar to 21 09260A B cell 3.1 

430271 T06199 Hs.237506 DnaJ (Hsp40) homolog, subfamily B, membe 3.1 

425317 AW205118 Hs.210546 interleukln 21 receptor 3.1 

10 426095 A1278023 Hs.89986 ESTs 3.1 

442313 BE388898 Hs.8215 hypothetical protein FU11307 3.1 

424709 AL137589 Hs.152149 hypothetical protein DKFZp434K0410 3.1 

429671 BE379335 Hs.211594 proteascme (prosome, macropafn) 26S subu 3.1 

432715 AA247152 Hs.200483 ESTs, WeaWy simiar to KIAA1074 protein 3.1 

15 431574 AW572659 Hs.261373 hypothetical protein dJ43401 4.3 3.1 

436876 AI124756 Hs.5337 rsocitnate dehydrogenase 2 (NADP), mitoc 3.1 

405017 NA Target Exon 3.1 

433805 AA706910 Hs.1 12742 ESTs 3.1 

437352 AL353957 Hs.284181 hypothetical protein DKF2p434P0531 3.1 

20 430105 X70297 Hs.2540 cholinergic receptor, nicotinic, alpha p 3.1 

422083 NM»001141Hs.111256 arachidonate 15-ilpoxygenase, second typ 3.1 

413507 BE145360 Hs.190064 ESTs, Weakly similar to I38022 hypotheti 3.1 

415989 A1267700 Hs.317584 ESTs 3.1 

422907 AI879263 Hs.6986 Human glucose transporter pseudogene 3.1 

25 425548 AA890023 Hs.1906 prolactin receptor . 3.1 

422599 BE387202 Hs.116638 non-metastatic cefe 1, protein (NM23A) 3.1 

439963 AW247529 Hs.6793 platelet-activating factor acetylhydrola 3.1 

453883 AI638516 Hs.22630 cofactor required for Sp1 transcriptlona 3.1 

458021 AI885190 Hs.156089 ESTs, Weakly similar to repressor protei 3.1 

30 418478 U38945 Hs.1174 cydin-dependent kinase Inhibitor 2A (me 3.1 

400814 NA Target Exon 3.1 

402327 Target Exon 3.1 

416935 AA190712 gb2p87f09.fi Stratagene HeLa celi s3 93 ,3.1 

439838 AL355722 Hs.106875 Homo sapiens EST from done 35214, full 3.1 

35 437036 AI571514 Hs.133022 ESTs 3.1 

449523 NMJ300579HS.54443 chemokine (C-C motjf) receptor 5 3.1 

406642 AJ245210 gbrHomo sapiens mRNA for Immunoglobulin 3.1 

406624 AF052762 gfcHomo sapiens clone csneg8-1 immunoglo 3.1 

421924 BE514514 Hs.109606 coronin, actin-binding protein, 1A 3.1 

40 414523 AU076633 Hs.76353 serine (or cysteine) proteinase inhiblto , 3.1 

416379 N38857 Hs.203933 ESTs 3.1 

422823 D89974 Hs.121102 vanin2 3.1 

433904 AI399956 Hs.208956 ESTs 3.1 

421904 BE143533 Hs.109309 hypothetical protein FU2O035 3.1 

45 428834 AW899713 Hs.339315 ESTs 3.1 

436043 AW963838 Hs.168830 Homo sapiens cDNA FU 12136 frs, done MA 3.1 

452823 AB012124 Hs.30696 transcription factor-Gke 5 (basic helix . 3.1 

405381 NA Target Exon 3.1 

428746 AW503820 Hs.192861 Spi-B transcription factor (SpM/PU.1 r 3.1 

50 435147 AL133731 Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (f3.1 

425782 U66468 Hs.159525 cell growth regulatory vvith EF-nand doma 3.1 

423306 W88562 Hs.108198 ESTs 3.1 

419123 AA234276 Hs.88253 ESTs 3.1 

438581 AW977766 Hs.292133 ESTs, Moderately similar to 178885 serin 3.1 

55 417105 X60992 Hs.81226 CD6 antigen 3.0 

428361 NM_015905Hs.183658 transcriptional Intermediary factor 1 3.0 

417880 BE241595 Hs.82848 selectln L (lymphocyte adhesion molecule 3.0 

402606 NM_024626:Homo sapiens hypothetical prot 3.0 

401451 NMJ)04496*:Homo sapiens hepatocyte nude 3.0 

60 421878 AA299652 Hs.1 11496 Homo sapiens cDNA FU1 1643 fis, done HE 3.0 

409518 BE384836 Hs.3454 KIAA1821 protein 3.0 

416933 BE561850 Hs.80506 smafl nuclear rfbonudeoprotein polypept 3.0 

414324 Y14768 Hs.B90 lymphotoxin beta (TNF superfamBy, membe 3.0 

425081 X74794 Hs.1 54443 minlchromosome maintenance deficient (S. 3.0 

65 401519 C15000476*:gl|12737279|ref|XP_012163.1| 3.0 

411704 AI499220 Hs.71573 hypothetical protein FU10074 3.0 

428819 AL135623 Hs.193914 KIAA0575 gene product 3.0 
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428423 AU076517 Hs.184276 solute carrier family 9 (sodium/hydrogen. —3.0 

413835 AI272727 Hs.249163 fatty add hydroxylase 3.0 

412600 L28624 Hs.74101 spleen tyrosine kinase 3.0 

410491 AA465131 Hs.64001 Homo sapiens done 25218 mRNA sequence 3.0 

433658 L03678 Hs.156110 Immunoglobulin kappa constant 3.0 

427666 AI791495 Hs.180142 calmoduMe skin protein. 3.0 

452514 AI904898 gb:RC-BT068-13O399-085BT0B8Homosaplen3.0 

429500 X78565 Hs.289114 hexabrachlon {tenasdn C, cytotacfin) 3.0 

432485 N90866 Hs.276770 CDW52 antigen (CAMPATH-1 antigen) 3.0 . 

437400 AB011542 Hs.5599 EGF-Oke-domaln, multiple 5 3.0 

452234 AW084176 Hs.223296 ESTs, Weakly similar to I38022 hypothetl 3.0 

413269 BE167526 gb:C^mfT0509^80300-107^07HT0509Horrw 3.0 

453216 AL137566 Hs.32405 Homo sapiens mRNA; cDNA DKFZp586G0321 (f 3.0 

400929 , ENSP00000252232*:Sterol regulatory eleme 3.0 

445145 AI961702 Hs.147434 ESTs 3.0 

432615 AA557191 Hs.55028 ESTs, WeaJdy similar to 154374 gene NF2 3.0 

423279 AW959861 Hs.290943 ESTs 3.0 

429392 AL109712 Hs.296506 Homo sapiens mRNA tuQ length insert cON 3.0 

408548 AA055449 Hs.63187 ESTs, WeaWy similar to ALUC.HUMAN Hil 3.0 

451346 NM_006338Hs.26312 glioma amplified on chromosome 1 protein 3.0 

413109 AW389845 Hs.110855 ESTs . 3.0 

401714 NA ENSP00000241802*:CDNAFU11007FIS t CLON 3.0 

421462 AF016495 Hs.104624 aquaporin9 3.0 

421750 AK000768 Hs.107872 hypothetical protein FU20761 3.0 

453293 AA382267 Hs.10653 ESTs 3.0 

457085 AA412446 Hs.98138 ESTs 3.0 

438930 AW843633 Hs.306163 hypothetical protein AL1 101 15 3.0 
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TABLE 22A 



Table 22A shows the accession numbers for those pkeys lacking unigenelD's for Table 22. 
5 For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

15 

Pkey CAT number Accessions ' 

I 

AA046309 AI263500 AA046397 

AW803341 AW803265 AW803403 AW803466 AW803402 AW803413 AW803268 AW803396 AW803334 AWB03355 
AW862214 AW859811 AW862215 

AW895387 AW895547 AW895564 AW895323 AW895405 AW895539 AW895538 
BE167526 BE167651 BE076401 R24654 
AA190712 AA190665 AA252564 
AW881 145 AA490718 M85637 AA304575 T06067 AA331991 
AA410943 AW948953 AA334202 AA332882 

AW406878 AW966560 AW986151 AW966496 AA336174 AA335376 AA335537 
AW966163 AA335983 AA33601 1 AA335668 AA335973 
AW962128 AA355353 AA427363 
BE069341 AW748403 AL044891 AI908240 AA393080 
AI821926 AA658826 AA564492 AA635129 AI791191 
BE562828 BE378727 
AI475858 AW969013 
AL1 18668 D78823 AI7621 76 
AI904898 AI304849 AI904899 
AA193450 



407980 103087J 
410785 1221055J 

20 411743 1256098J 
412138 1279172_1 
413269 1356961.1 
416935 163179J 
422128 211994J 

25 423945 233566J 
424109 235506J 
424128 235728 1 
425331 250199J 
426878 273265 1 

30 432745 353673J 
441153 51084.2 
448212 755099J 
451128 859865J 
452514 920172J 

35 456207 165078.-1 
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TABLE 22B 



Table 22B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 22. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Refc Sequence source. The 7 d&it numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a!/ refers to the publication 

, entitled The DNA sequence of human chromosome 22." Dunham I, et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 


Strand 


Nt .position 


400814 8569925 


Minus ' 


72840-72924,74761-74849 


400929 7651921 


Minus 


122033-122241,123483-124028 


401045 8117619 


Plus 


90044-90184.91111-91345 


401451 6634068 


Minus 


119926-121272 


401519 6649315 


Plus 


157315-167950 


401645 7657839 


Minus 


34986-35133 


401714 6715702 


Pius 


96484-96681 


401866 8018106 


Plus 


73126-73623 


402327 7656695 


Minus 


108675-108770,109801-109910 


402359 9211204 


Minus 


. 4040341961 


402408 9796239 


Minus 


110326-110491 


402470 9797107 


Plus 


195129-195776 


402542 9801558 


Minus 


67076-67594 ■ 


402578 9884928 


Plus 


66350-66496 


402606 9909429 


Minus 


8174742094 


403011 6693597 


Minus 


34684623 


403212 7630897 


Minus 


156037-156210 


403329 8516120 


Plus 


96450-96598 


403366 8783692 


Minus 


4932349652 


403485 9966528 


Plus 


2888-3001,3198-3532,36554117 


404347 9838195 


Plus 


74493-74829 


404580 6539738 


Minus 


240588-241589 


404755 7706327 


Minus 


5372943846 


405017 6532084 


Plus 


35551-35690 


405348 2914717 


Minus 


4331043462 


405381 6006920 


Minus 


76364054 


405801 2924321 


Plus 


6346943694 


405850 6164995 


Plus 


13871-14110 


406153 9929734 


Minus 


12902-13069 


406348 9255985 


Minus 


71754-71944 
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TABLE 23: 320 GENES DOWN-REGULATED IN BREAST CANCER 

COMPARED TO NORMAL ADULT BREAST 



10 



Table 23 shows 320 genes down-regulated in breast cancer compared to normal adult breast. 
These were selected as for Table 22, except that the numerator was set to the median value 
for 12 non-malignant breast specimens, the denominator was set to the median value amongst 
the 73 breast cancers, the 90 th percentile value amongst the 12 non-malignant breast 
specimens was greater than or equal 80 units, and the ratio was greater than or equal to 4.0 
4-fold down-regulated in tumor vs. normal breast). 



Pkey: 
ExAccn: 
15 UnigeneiD: 
Unigene Title: 
R1: 



Unique Eos probeset identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Ratio of 50 th percentile normal body tissue to 75* percentile tumor 



20 Pkey ExAccn UnigeneiD UnlgeneTitle 



Ratio 



25 



30 



35 



40 



45 



50 



55 



60 



428722 
428848 
445263 
418935 
407228 
417511 
439285 
412442 
410544 
412047 
422657 
406654 
423201 
422163 
428769 
407049 
412295 
425126 
405791 
447471 
451533 
419407 
411939 
410532 
425707 
416585 
443050 
435265 
422511 
433138 
402195 
429350 
445107 
406643 
410199 
417225 
437569 
436062 
425078 
430327 



U76455 

NM.000230 

H57646 

T28499 

M25079 

AL049176 

AL133916 

AI983730 

AI446543 . 

AA934589 

H25642 

L34041 

NM-000163 

AF027208 

AW207175 

X72632 

AW088826 

N32759 

AI220684 

AF039843 

NMJ04657 

AW410377 

AI365585 

T53088 

AF115402 

X54162 

D78874 

AA779958 

AU076442 

AB029496 

AI754634 

AI208121 

N77976 

AW377424 

AA815048 

M760849 

AK000027 

NfVL002599 

AW973636 



Hs.190787 

Hs.194236 

Hs.42586 

Hs.89485 

Hs.155376 

Hs.82223 

Hs.172572 

Hs.26530 

Hs.95511 

Hs.49696 

Hs.133471 

Hs.9739 

Hs.125180 

Hs.1 12360 

Hs.106771 

Hs.1 17176 

Hs.172944 

Hs.272572 

Hs.18676 

Hs.26530 

Hs.41502 

Hs.146246 

Hs.155376 

Hs.11713 

Hs.79386 

Hs.8944 

Hs.185932 

Hs.117938 

Hs.59729 

Hs.131987 

Hs.147313 

Hs.272572 

Hs.205126 

Hs.24078 

Hs.294052 

Hs.98633 

Hs.154437 

Hs.55931 



tissue inhibitor of metaRoproteinase 4 22.4 

leptin (murine obesity homolog) 17.4 

KIAA1560 protein 15.4 

carbonic anhydrase IV 1 5.0 

hemoglobin, beta 14.6 

chordln-Iike 14.6 

hypothetical protein FU20093 . 14.3 

serum deprivation response (phosphatidyl 13.6 

ESTs 12.6 

ESTs 12.2 

ESTs 12.0 

glycerol-3-phosphate dehydrogenase 1 (so 12.0 

growth hormone receptor 1 1 .7 

promintn (mouse)-Jike 1 10.8 

ESTs 10.6 
NMJ)21724*:Homo sapiens nuclear receptor 10.1 

poty(A)-binding protein, nuclear 1 9.8 

chorionic gonadotropin, beta polypeptide 9.8 

hemoglobin, alpha 2 9.5 

sprouty (DrosophBa) homolog 2 9.5 

serum deprivation response (phosphatidyl 9.4 

hypothetical protein FU21276 9.0 

ESTs 9.0 

hemoglobin, beta 8.9 

E74-like factor 5 (ets domain transcript 8.8 

leiomodin 1 (smooth muscle) 8.7 

procollagen Oendopeptidase enhancer 2 8.6 

ESTs 8.5 

collagen, type XVII, alpha 1 8.4 

semaphorin sem2 8.3 
NM_004497*:Homo sapiens hepatocyte nucie 8.1 

ESTs 8.1 

ESTs, Weakly similar to I38022 hypotheti 8.1 

hemoglobin, alpha 2 8.0 

Homo sapiens cONA: FU22667 fts, clone H 8.0 

hypothetical protein FU12649 7.8 

ESTs 7.5 

ESTs 7* 

phosphodiesterase 2A. cGMP-stimulated 7.5 

ESTs 7.4 
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447577 AI393693 Hs.183297 DKFZP566F21 24 protein 7.4 

446039 AI150491 Hs.90756 ESTs 7.2 

422060 R20893 Hs.325823 ESTs, Moderately similar to ALU5.HUMAN A 7.2 

424455 AA452006 Hs.333199 ESTs 7.1 

424343 AW956360 Hs.4748 adenylate cyclase activating polypeptide 7.1 

442792 AI352340 Hs.131194 ESTs 7.0 

406714 AI219304 Hs.283108 hemoglobin, gamma G 6.9 

407571 AI446183 Hs.9572 ESTs, Highly similar to CYA5_HUMAN ADENY 6.8 

429580 AA346839 Hs.209100 . DKFZP434C171 protein 6.7 

453500 AI478427 Hs.43125 esophageal cancer related gene 4 protein 6.7 

422233 AB002058 Hs.113275 purinergic receptor P2X-Iike 1, orphan r 6.7 

420205 AA256395 Hs.88156 ESTs 6.6 

404368 NA ENSPOOQ00241075*:TRRAP PROTEIN. 6.6 

447261 NM_006691 Hs.17917 extraceBuIar link domain-containing 1 6.5 

417090 AA193282 Hs.85863 ESTs. Weakly similar to B34612 zinc fing 6.5 

410677 NM_003278 Hs.65424 tetranectin (plasminogen-blnding protein 6.5 

427283 All 19796 Hs.174185 ectonucleotidepyrophosphatase/phosphodi 6.5 

41501 1 AW963085 gb:EST375158 MAGE resequences, MAGH Homo6.4 

412068 S72043 Hs.73133 metaiiothionein 3 (growth inhibitory fac 6.4 

416253 BE250659 Hs.15463 Homo sapiens, done IMAGE:2959994, mRNA 6.4 

435885 AA701483 Hs.36341 ESTs 6.3 

402779 NA Target Exon 6.3 

418138 AA213626 Hs.136204 EST 6.3 

439335 AA742697 Hs.62492 ESTs, WeaWy similar to B39066 proiine-r 6.3 

427019 AA001732 Hs.173233 hypothetical protein FU10970 6.2 

411478 BE143068 gb:MRm0158-030200-003-b09HT0158Homo6.2 

452654 BE004783 gb:MR2-BN0114-27040M04-e11 BN0114 Homo6.1 

447359 NM.012093 Ks.18268 adenylate kinase 5 6.1 

414323 NM.014759 Hs.334688 K1AA0273 gene product 6.1 

441266 H15968 Hs.293845 Homo sapiens, done IMAGE:3502329, mRNA, 6.1 

417011 F08212 Hs.234898 ESTs, Weakly similar to 21 09260AB cell 6.0 

400089 NA Eos Control 6.0 

433614 W07475 Hs.277101 cytochrome c oxidase subunit IV feoform 5.9 

440439 N92818 Hs.64754 ESTs, Weakly similar to potential COS (H 5.9 

454404 BE067414 gb:MR4-BTO355-2001 00-201 -e05BT0355 Homo 5.9 

436704 AA062610 , Hs.148050 EST ■ 5.9 

406583 NA Target Exon 5.9 

433490 AW451023 Hs.65848 hypothetical protein DKFZp7610132 5.9 

419313 AA843387 Hs.87279 ESTs 5.9 

409196 NWL001874 Hs.334873 carboxypeptidase M 5.8 

410882 AW809163 gb:MR4-ST01 18-261 099412-a03 ST01 18 Homo 5.8 

453469 AB014533 Hs.33010 KIAA0833 protein 5.8 

441899 AI372568 Hs.8022 TU3A protein 5.8 

426210 AA372052 Hs.334559 Homo sapiens cONA FU14458 fis, clone HE 5.8 

413065 BE063555 gb:CM1-BT0283-081199-O33-d09BT0283 Homo 5.8 

454192 AW876813 Hs.3343 phosphoglycerate dehydrogenase 5.7 

425187 AW014486 Hs.22509 ESTs 5 J 

.429757 AW452355 Hs.256037 ESTs 5.7 

420202 AL036557. Hs.95910 putative lymphocyte G0/G1 switch gene 5.7 

416284 AI695473 Hs.298006 ESTs 5.7 

428553 AA181641 Hs.184907 G protein-coupled receptor 1 5.6 

404689 NA Target Exon 5.6 

438887 R68857 Hs.265499 ESTs 5.6 

406082 S47833 Hs.82927 adenosine monophosphate deaminase 2 (iso 5.6 

449748 H23963 Hs.32043 ESTs 5.6 

431048 R50253 Hs.249129 cell death-Inducing DFFA-like effector a 5.5 

452205 C15819 gb:C15819 Clontech human aorta polyA mRN 5.5 

434040 AW444613 Hs.288809 hypothetical protein FU20159 5.5 

407744 AB020629 Hs.38095 ATP-binding cassette, sub-family A (ABC1 5.5 

450606 AI668605 Hs.60380 ESTs, Moderately similar to ALU6JHUMAN A 5.5 

414629 AA345824 Hs.76688 carboxylesterase 1 (monocyte/macrophage 5.5 

401665 C110OT703gi|i0048448|reflNP_065258.1|g 5.5 

436107 T99079 Hs.191194 ESTs ^ 5.5 

444432 AI161428 Hs.75916 splicing factor 3b, subunit 2, 145kD 5.5 

434715 BE005346 Hs.1 16410 ESTs 5.5 

447205 BE617015 Hs.11006 . ESTs, Moderately similar to T1 7372 plasm 5.5 

408122 AI432652 Hs.42824 hypothetical protein FU10718 5.5 
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65 



454016 

414913 

459033 

441003 

450637 

442398 

403612 

407102 

410057 

428232 

432769 

431344 

427032 

406305 

437411 

442800 

402054 

432085 

415313 

459159 

427164 

441391 

458959 

402698 

401810 

438879 

414657 

427809 

456063 

451186 

451882 

402583 

431130 

458218 

416083 

455282 

426488 

426156 

407891 

408610 

445967 

434813 

437526 

454775 

409451 

409853 

405062 

446490 

417622 

421978 

440338 

415421 

417574 

409882 

447998 

445613 

443074 

451324 

432433 

449654 

414519 

457531 

433200 

430782 

427555 

433545 

420334 



AW016806 . 

R25621 

AA017590 

BE172240 

N49826 ' 

AA994520 

NA 

AA007629 

R66634 

BE272452 

AA620814 

R99530 ' 

AF012023 

BE261320 , 

AW613948 

AI809481 

NA 

AF212829 

R59638 

AI9G4646 

AB037721 

BE467930 

AI285901 

NA 

NA 

AA827674 

AA424074 

M26380 

NM.006744 

AW023469 

A1821324 

NA 

NM 006103 

AI435179 

R53467 . 

BE143867 

X03350 

BE244537 

AA486620 

AW026692 

D59597 

AI524307 

AI076012 

BE160229 

AF012626 

AW502327 

AK000706 

AW298163 

AJ243662 

R62431 

R35009 

R00348 

AJ243191 

A1768289 

BE550889 

AW341470 

AI783600 

AW014734 

AI989812 

N94587 

AW973716 

AA682722 

AF026263 

AW137094 

AA868510 

AI349351 



.Hs.233108 ESTs 



H3.129907 
Hs.126379 
Hs.18602 



Hs.9739 

Hs.268107 

Hs.183109 

Hs.144959 

Hs.272572 

HS.173274 

Hs.158196 

Hs.194915 

Hs.131227 

Hs.272406 
Hs.6181 

Hs.173871 
Hs.170381 
Hs.181297 



Hs.189073 

Hs.76780 

Hs.180878 

Hs.76461 

Hs.65256 

Hs.100445 

Hs.2719 

Hs.126820 

Hs.269122 

Hs.4 

Hs.167382 

Hs.41135 

Hs.224829 

Hs.1 18821 

Hs.162870 

Hs.121388 

Hs.54472 



Hs.15125 
Hs.82318 
Hs.1 10196 
Hs.12758 
Hs.24903 

Hs.56874 

Hs.304389 

Hs.158491 

Hs.144907 

Hs.208052 

Hs.157969 

Hs.199850 

Hs.55063 

Hs.13913 

Hs.192725 

Hs.247920 

Hs.97990 

Hs.1 12496 

Hs.118944 



gb^h45f06.rl Soares placenta Nb2HP Homo 
ESTs 

ESTs, Weakly similar to 138022 hypotheti 
ESTs 



5.5 
5.4 
5.4 
5.4 
5.4 



gb:ou42g09.s1 SoaresJIFLJ_GBC_S1 Homos5.4 

Target Exon 5.3 

glyceroi-3-phosphate dehydrogenase 1 (so 5.3 

multlmertn 5.3 

monoamine oxidase A 5.3 

ESTs 5.3 

hemoglobin, alpha 2 5.3 

rntegrin cytoplasmic domain-associated p 5.3 

transcriptional adaptor 3 (ADA3, yeast h 5.3 

ESTs 5.3 

ESTs 5.3 

Target Exon 5.3 

potassium channel, subfamily K, member 9 5.3 

ESTs 5.2 
gb:QV-BT065420399-103 BT065 Homo sapien 52 

KIAA1300 protein 5.2 

ESTs 5.2 

ESTs 5.2 

ENSP00000251335*:DJ1003J2.1 {sodium and 5.2 

Target Exon 5.2 

ESTs 5.2 

protein phosphatase 1 , regulatory (Inhtb 5.2 

lipoprotein lipase 5.1 

retinoi-binding protein 4, Interstitial 5.1 

ESTs, Weakly similar to leucine-fich gli 5.1 

ESTs 5.1 

NM_021620:Homo sapiens PR domain contain 5.1 

HE4; WFDC2; putative ovarian carcinoma m 5.1 

ESTs 5.1 

ESTs, Weakly similar to ALULH^MAN ALU S 5.1 
gb:MRO-HT0164-070100-013-h02 HT0164 Homo5.1 

alcohol dehydrogenase 1B (class l), beta 5.1 

natriuretic peptide receptor A/guanytate 5.1 

endomucBv2 5.0 

ESTs 5.0 

CGI-62 protein 5.0 

ESTs 5.0 
ESTs, Weakly similar to MDHC_HUMAN MALAT 5.0 
gb:QV1-HT0413490200-062-a12 HT0413 Homo 5.0 

fragile X mental retardation 2 5.0 

gb:UWF-BROr>aka-a-07*Ul.r1 NIH_MGC_5 5.0 

Target Exon 5.0 

hypothetical protein FU20699 5.0 

WAS protein family, member 3 5.0 

NICE-1 protein 5.0 

ESTs 5.0 

ESTs 5.0 

gb:ye69e06.r1 Soares fetal liver spleen 5.0 

heat shock 27kD protein family, member 7 5.0 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

ESTs 4.9 

KIAA1577 protein 4.9 

ESTs 4.9 

cholinergic receptor, muscarinic 5 4.8 

ESTs 4.8 

ESTs 4.8 

hypothetical protein FU22477 4.8 
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421795 
427138 
413072 
443721 
408053 
427067 
442969 
426220 
414593 
426893 
434046 
401590 
457971 
427722 
443793 
407737 
441955 
441499 
447517 
403017 
450580 
404611 
414831 
459290 
444341 
408614 
449638 
434418 
447360 
419583 
440698 
451199 
438338 
433756 
423301 
417237 
439745 
424137 
449338 
434744 
407402 
443510 
415754 
415986 
457416 
418064 
437120 
453950 
401093 
436935 
457974 
428222 
442705 
437409 
458494 
410490 
416069 
438463 
444043 
438327 
457711 
400870 
441425 
416267 
407262 
444567 
403263 



X63094 

N77624 

BE063965 

AW450451 

AW139474 

AA843716 

AI025499 

AI383475 

BE386764 

AA398716 

AW292618 

NA 

AW134679 

AK000123 

AA045290 

R49187 

AA972327 

AW298235 

AI382726 

N40087 

H58589 

M31158 

NMJ01546 

AI142027 

AL137698 

AW204277 

AF134707 

AI375984 

F00312 

AI348455 

AI290653 

NMJJ14861 

AW015933 

S67580 

H86385 , 

AL389981 

AA335769 

H73444 

N94835 

AF035303 

NNL012190 

AA169114 

Z43619 

BE142052 

BE387287 

AI356125 

AA156998 

AW206494 

AW842353 

AL133112 

AI264634 

AL359599 

AI380906 

H03589 

R37101 

AA807958 

AI499723 

H87407 

AF147401 

AA933590 

H45384 

M12873 

AV654020 

NA 



Hs.283822 Rhesus blood group, D antigen 4.8 

Hs.173717 phosphatidic acid phosphatase type 2B 4.8 
gb:QV3-BT0296-140200-085-h01 BT0296 Homo 4.8 

Hs^66355 ESTs 4.8 

Hs.246862 ESTs 4.8 
Hs.177927 ESTs 
Hs.132238 ESTs 

Hs.171697 ESTs, Weakly similar to T13924 sdk prate 

. gb:601273249F1 NIH.MGC JO Homo sapiens c 
Hs.97418 ESTs 
Hs.1 13011 ESTs 

Target Exon 
Hs.242849 ESTs 

Hs.160479 hypothetical protein FU201 16 

Hs.25930 ESTs, Weakly similar to 2109260A B cell 4.6 

Hs.6659 ESTs 4.6 

Hs.142903 ESTs 4.6 

Hs.101689 ESTs 4.6 

Hs.182434 ESTs 4.6 

Target Exon 4.6 

Hs.15248 ESTs 4.6 

Hs.35156 Homo sapiens cDNA FLJ1 1027 fis, clone PL 4.6 

Hs.77439 protein kinase, cAMP-dependent, regulato 4.6 

Hs.34853 inhibitor of DNA binding 4, dominant neg 4.6 
Hs.146650 ESTs 
Hs.46531 Homo sapiens mRNA; cDNA DKFZp434C1915 (f 4.6 

Hs.250723 hypothetical protein MGC2747 . 4.6 

Hs.278679 a disintegrin and metalloproteinase doma 4.6 

Hs.167216 ESTs 4.6 

gb:HSBB0D101 STRATAGENE Human skeletal m4.6 

Hs.147492 • Homo sapiens cDNA FU1 1777 fis, clone HE 4.6 

Hs.124758 ESTs - 4.6 

Hs.6168 KIAA0703 gene product 4.6 

Hs.1 12654 Homo sapiens, clone MGCr9764, mRNA, comp 4.5 

Hs.1645 cytochrome P450, subfamily IVA, polypept 4.5 

Hs.81737 palmJtoyt-protein thtoesterase 2 4.5 

Hs.149219 Homo sapiens mRNA full length insert cDN 4.5 

Hs.16262 ESTs 4.5 

Hs.394 adrenomeduiiin 4.5 

Hs.283828 Homo sapiens genomic DNA, chromosome 21q 4.5 

gb:Homo sapiens clone 23943 mRNA sequenc 4.5 

Hs.9520 formy Itetra hydrof olate dehydrogenase 4.5 

Hs.12247 hypothetical protein FU1 1413 4.5 

gb:HSC1GE121 normalized infant brain cDN 4.5 
Hs.62654 kringle-contalntog transmembrane protein 
Hs.83384 S100 calcium-binding protein, beta (neur 
Hs.157767 ESTs, Weakly similar to HXA2_HUMAN HOMEO 
Hs.21 1 568 eukaryotic translation Initiation factor 

C12000586*:gl|6330167|dbj|BAA86477.1| (A 
Hs.253560 ESTs 

Hs.321717 ESTs, Weakly similar to S22765 heterogen 
Hs.183085 Homo sapiens mRNA; cDNA DKFZp434K09B (fr 
Hs.131127 ESTs 

Hs.283850 Homo sapiens mRNA; cDNA DKFZp547C126 (fir 
Hs.1 58436 ESTs 

gbyj42d08.r1 Soares placenta Nb2HP Homo 
Hs.20982 ESTs 
Hs.314232 ESTs 
Hs.135089 ESTs 

Hs.1 72944 chorionic gonadotropin, beta polypeptide 
Hs.23917 ESTs 

C1 1000905:gi|1 1 692565|gb|AAG39879.1 ]AF28 
Hs.28937 homeobox protein from AL590526 

gb:yn99c10.r1 Soares adult brain N2b5HB5 

gb:Human Ig rearranged H-chaln mRNA VDJ4 

Hs.184261 ESTs, Weakly similar to T26686 hypotheti 4.3 

Target Exon 
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410034 BE067414 
45^804 AI421645 
448427 BE395260 
416931 D45371 
5 421296 NM002666 
400973 NA 
452602 AW366194 
412330 NMJW5100 
405016 
10 435104 AI475671 
406118 

418556 T02850 
429745 AA480818 
433088 AW451206 

15 444445 AA342329' 
453880 AI803166 
447384 AI377221 
414541 BE293116 
444975 AV652165 

20 403921 NA 

451477 AI798425 
406344 

416970 AA191201 
413662 BE155866 
25 45B504 AW070634 
404682 NA 
418089 N69913 
403433 

446532 AW975460 

30 414217 AI309298 
418425 A1871247 
419589 AW973708 
457029 AA397789 
447860 AF193807 

35 448988 Y09763 
440610 AI733098 
439590 AF086410 
427240 AA399975 
408932 AW594172 

40 436112 T77545 
444382 A1144152 
456716 AA318060 
419846 NH.015977 
410036 R57171 

45 400545 NA 
403051 NA ■ 
420139 NJVL005357 
450244 AA007534 
453261 AA034116 

50 440246 W52010 
414516 AI307802 
438232 AI150595 
410233 AA082947 
412179 BE270758 

55 441871 AI306150 
426411 AK000708 
453692 AL1 10416 
448640 AW817177 
417481 AA203281 

60 412912 AW118878 
454183 AW807116 
426328 AW631296 
435942 R06285 
417629 T76945 

65 403593 NA 
402690 

418190 R49591 



gb:MR4-BTQ355-200100-201-e05 BT0355 Homo4.3 

Hs.139851 caveolin2 4.3 

Hs.309438 EST 4.3 

Hs.80485 adipose most abundant gene transcript 1 4.3 

Hs.103253 periiipin 4.3 

ENSP00000236667*:Mudn 5B (Fragment). 4.3 

Hs.55962 ESTs 4.3 

Hs.788 A kinase (PRKA) anchor protein (gravin) 4.3 

CY000171*igi|9280405|gb|AAF86402.1|AF245 4.3 

Hs.88607 ESTs, Highly simitar to F-box protein FB 4.3 
' ENSP00000246632:CDNA FU20261 (is, clone 4,3 

gb:FB12A9 Fetal brain, Stratagene Homo s 4.3 

Hs.221736 ESTs 4.3 

Hs. 115899 ESTs 4.3 

Hs.115920 Homo sapiens cDNA: FU22B16 fls, done K 4.3 

Hs.28462 ESTs, Weakly similar to (38022 hypotheti 4.3 

Hs.40528 ESTs 4.2 

Hs.76392 aldehyde dehydrogenase 1 family, member 4.2 

Hs.182482 ESTs, Weakly similar to T00362 hypotheti 4.2 

C500021 2*:gl| 1 0047237|dbj]BAB 1 3407. 1 1 {A 4.2 

Hs.42710 ESTs 4.2 

C5001660:gl|11611537|dbj|BAB18935.1| (AB 4.2 

Hs.35861 DKFZP586E1621 protein 4.2 

Hs.25522 KIAA1808 protein 42 

Hs.144794 ESTs 4.2 

C9001188*:gi|12738842|ref|NP_073725.1|p 4^2 

Hs.6858 ESTs, Weakly similar to I78885 serine/th 4.2 

NrVL001622:Homo sapiens aIpha-2-HS-glycop 4.2 

Hs.143563 ESTs 4.2 

Hs.279898 Homo sapiens cDNA: FU23165 fis, done L 4.2 

Hs.626'2 hypothetical protein MGC8407 4.2 

Hs.201925 Homo sapiens cDNA FU13446 fis, done PL 42 

Hs.161803 ESTs 42 

Hs.131835 Rhesus blood group, B glycoprotein 42 

Hs.22765 gamrna-aminobutyric acid (GABA) A recepto 4.2 

Hs.130800 ESTs 4.2 

gb:Homo sapiens full length insert cDNA 4.2 

Hs.274151 iigatjn . 4.2 

Hs.278513 TP53TG3 protein 4.2 

Hs.187559 ESTs 4.2 

Hs.58246 ESTs 4.2 

Hs.135121 hypothetical protein FU22415 4.2 

Hs.285681 Williams-Beuren syndrome chromosome regi 4.2 

Hs.57975 calsequestrin 2 (cardiac muscle) 4.1 

Target Exon 4.1 

Target Exon 4.1 

Hs.95351 lipase, hormone-sensifive 4.1 

Hs.125062 ESTs 4.1 

Hs.1 18494 ESTs 4.1 

Hs.191379 ESTs 4.1 

Hs.1 35560 ESTs, Weakly similar to T43458 hypotheti 4.1 

Hs.122226 ESTs 4.1 

gb:zn10g07.s1 Stratagene hNT neuron (937 4.1 

Hs.69428 hypothetical protein MGC3020 4.1 

Hs.153450 ESTs, Weakfy similar to 1909123A Na glue 4.1 

Hs.169764 hypothetical protein FU20701 4.1 

gb:DKFZp434K0431_f1 434 (synonym: htes3) 4.1 

Hs.102558 Homo sapiens, done MGC:5352, mRNA, comp 4.1 

Hs.21798 ESTs 4.1 

Hs.110835 ESTs 4.1 
gb:MR4-ST0062-04010CM)24.b12 ST0062 Homo4.1 

gb:hh83o09.y1 NCLCGAPJ5U1 Homo sapiens 4.1 

Hs.191215 ESTs 4.1 

gb:yc92c07.r1 Soares Infant brain 1NIB H 4.1 

Target Exon 4.0 

Target Exon 4.0 

Hs.270425 ESTs 4.0 
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408641 AW245207 Hs.5555 hypothetical protein MGC5347 4.0 

427Q99 AA829286 Ms.332053 serum amyloid A1 4.0 

445975 AI811536 Hs.145734 ESTs 4.0 

438831 BE263273 Hs.6439 synapsin II 4.0 

455578 BE006350 Hs.14355 Homo sapiens cDNA FU 13207 fls, clone NT 4.0 

401840 NA Target Exon . 4.0 

413753 U17760 Hs.75517 laminin, beta 3 (niceln (125kD), kalihin 4.0 

445030 AI205925 Hs.147238 ESTs, Highly similar to AAC3_HUMAN ALPHA 4.0 

433873 AW156913 Hs.150478 ESTs, WeaWy similar to A Chain A, Cryst 4.0 

456736 AW248217 Hs.1619 achaete-scute complex (Drosophila) homo! 4.0 

450112 BE047734 Hs.5473 ' ESTs, Moderately similar to ALU5_HUMAN A 4.0 

448906 AI589567 Hs.309719 ESTs 4.0 
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TABLE 23A 



Table 23A shows the accession numbers for those pkeys lacking unigenelD's for Table 23. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 



Pkey: Unique Eos probeset Identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT number Accessions 

AW502327 AW502488 AW501829 AW502625 AW502687 
BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 
AA082947 AA083036 
H03589 AW750687 AW750688 
AW809163 AW809247 AW809177 AW809190 AW809225 

BE143068 AW849143 AW848705 AW848569 AW848071 AW848475 AW848092 AW848C05 ' 
BE063555 BE151321 BE151319 BE151657 BE151655 BE063556 BE151322 

BE063965 BE063968 BE064034 BE064028 BE063874 BE063966 BE063869 BE064043 BE064033 BE083884 
BE386764 BE387560 
R25621 C03959 C04010 
AW963085 AA159005 AW963073 
Z43619 R61274 H12206 R12B83 
H45384 H49125 H41699 
R00348 R09593 
T76945 R2Q210R05755 
T02850 

F00312 AA247490 F31427 AA383663 F22045 
AW631296AA375484 
AF086410W94386W74609 
AA994520 AW393574 
C15819 AA024741 AA024742 
BE004783 BE004947 AI911790 
AL1 10416 AW876759 

AW807116 AW807569 AW807415 AW807338 AW807288 AW807263 AW807316 AW177402 AW807413 AW807068 BE141 561 
BE141569 AW807401 AW807310 BE141565 AW807318 AW8071 19 AW807299 AW807241 AW807225 AW807204 AW807345 
AW807103 BE141615 AW807431 AW807393 AW807337 AW807406 AW807259 AW807375 AW845890 AW807220 AW807399 
AW807064 AW807376 AW807024 BE141595 AW807236 AW807027 AW807112 AW807378 AW807202 BE141593 AW807216 
AW807138 AW807244 AW807221 AW807297 AW807050 AW807248 AW807404 AW807075 AW807237 AW807212 AW807308 
AW8071 10 AW807104 BE140912 AW807301 AW807382 AW807294 AW807026 AW807020 AW807108 AW807025 AW807433 
AW807124 AW807419 AW807031 AW807264 AW807032 AW807029 AW807052 AW807391 AW807207 AW807215 AW807019 
AW807238 AW807201 BE141590 AW807302 AW807323 AW807380 AW807109 BE1415B8 AW845877 AW807418 AW807407 
AW807309 BE141614 AW845861 AW807396 AW807300 AW807348 AW80731 1 AW807214 AW807132 AW807402 AW807350 
AW807028 AW807298 AW807291 AW807305 AW807217 AW807312 AW807261 AW807268 AWB07260 AW807266 AW807198 
AW807346 AW807315 BE140940 AW807351 AW807122 AW807235 AW807076 AW807314 AW807213 AW807143 AW807131 
AW800079AW807066 AW807077 AW845923 AW807421 BE141619 BE140943 AW807420 AW807381 BE141621 AW845921 
BE141629 BE141625 BE141624 BE141636 BE141630 AW807405 AW807290 AW807353 
BE067414 BE067958 BE067419 BE067963 AW577127 AW601412 

BE160229 AW819879 AW820179 AW819882 AW819876 AW820169 BE153201 AW993736 BE152911 
BE143867 AW935060 AW886684 
AI904646 BE179494 BE179421 



409853 1156226J 

410034 1170594J 

410233 118656.1 

410490 1205347J 

410882 1225686J 

411478 1247073 J 

413065 1347960J 

413072 1348163.1 

414593 1464909J 

414913 1506721J 

415011 151328 J 

415986 1564410J 

416267 1583547 J 

417574 1687770.1 

417629 1690392.1 

418556 1767866.-1 

419583 186198.1 

426328 264901.1 

439590 47413.1 

442398 541271J 

452205 90415.1 

452654 925931.1 

453692 977825.1 

454183 1049636J 



454404 1170594.1 

454775 1234106J 

455282 1273020J 

459159 919998.1 
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TABLE 23B 

Table 23B shows the genomic positioning for those pkeys lacking unigene ID's and 
accession numbers in Table 23. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. 

Pkey: Unique number corresponding to an Eos probeset 

Ret Sequence source. The 7 digit numbers In this column are Genbank Identifier (Gl) numbers. 'Dunham I. et aL" refers to the publication 

, entitled "The DNA sequence of human chromosome 22." Dunham I. et at., Nature (1999) 402:489-495. 
Strand: indicates DNA strand from which exons were predicted. 

NLposHion: Indicates nucleotide positions of predicted exons. 



Pkey 


Ref 


Strand N<_posffion 


400545 


9800107 


Minus 


124618-124881 


400870 


9838306 


Minus 


34081-35027 


400973 


7960452 


Minus 


98119-98253 


401093 


8516137 


Minus 


22335-23166 


401590 


9966320 


Minus 


33547^3649 


401665 


7145001 


Plus 


121591-122537 


401810 


7342191 


Plus 


129063-129476 


401840 


7684597 


Plus 


56283-56439 


402054 


8083691 


Minus 


828&8806 


402195 


7689778 


Minus 


. 147901-148884 


402583 


7684486 


Plus 


94883-95003 


402690 


8348058 


Plus 


13368-13998 


402698 


8570304 


Minus 


108641-108903 


402779 


9588555 


Minus 


38173-39210 


403017 


6693623 


Plus 


78630-79367 


403051 


4827080 


Minus 


5269-541 1 


403263 


7770677 


Plus 


52431-52737 


403433 


9719611 


Minus 


72225-72437 


403593 


6862650 


Minus 


62554-62712,69449-69602 


403612 


8469060 


Minus 


94723-94859 


403921 


7711590 


Minus 


3297-3536 


404368 


7630956 


Minus 


102053-102199 


404682 


9797231 


Minus 


4097741150 


404689 


7534100 


Plus 


119461-119717 


405016 


6524300 


Plus 


51997-53308 


405062 


7657730 


Plus 


101283-101432 


406118 


9143818 


Plus 


53997-54629 


406344 


9255974 


Plus 


20254-20374,20526-20659,20835-21097 


406563 


7711604 


Plus 


34401-34538 
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TABLE 24: 

■ 

Table 24 depicts Seq ID No., UnigeneK), UnigeneTitle, Pkey, Pred.Cell.Loc, and ExAccn 
for all of the sequences in Table 25. The information in Table 24 is linked by Seq ID No. to 
Table 25. 



Pkey: 

ExAccn: 

UnjgenelD: 



Pred.Cellloc.: 
Seq.ID.No.: 



Unique Eos probeset Identifier number 

Exemplar Accession number, Genbank accession number 

Unigene number 

Unigene gene title 

Predicted Cellular Localization 

Sequence identification Number found in Table 25 



Pkey ExAccn Unigene© Unigene Title 



Pred.CeD.Loc. Seq. ID. No. 



449746 AI668594 Hs.176588 

407Z76 A1951118 Hs.326736 

415539 AI733881 Hs.72472 

400297 AI127076 Hs.334473 

450375 AA009647 Hs.8850 

102457 NJVL001394HS.2359 

429170 NNL001394HS-2359 

424399 AI905687 Hs.2533 

422505 AL120862 Hs.124165 

449765 N92293 Hs.206832 

425692 D90041 Hs.155956 

426215 AW963419 Hs.155223 

439840 AW449211 Hs.105445 

410102 AW24B508 Hs.279727 

429220 AW207206 Hs.136319 

416276 U41060 , Hs.79136 

409079 W87707 Hs.82065 

442818 AK001741 Hs.8739 

442082 R41823 Hs.7413 

444381 BE387335 Hs.283713 

446163 AA026880 Hs.25252 

416636 N32536 Hs.42645 

442117 AW664964 Hs.128899 

433043 W57554 Hs.125019 

429353 AL117406 Hs.200102 

452190 H26735 Hs.91668 

446733 AA863360 Hs.26040 

452747 BE1 53855 Hs.61460 

423242 AL039402 Hs.125783 

417433 BE270266 Hs.82128 

432201 AI538613 Hs.298241 

423961 D13666 Hs.136348 

439569 AW602166 Hs.222399 

114480 BE066778 Hs.151678 
404561 

325372 NA 

112287 AB033064 Hs.334806 

335824 NA 

424735 U31875 Hs.272499 

400289 X07820 Hs.2258 

427585 D31 152 Hs.179729 

429925 NM.000786HS.226213 

429441 AJ224172 Hs.204096 

421155 H87879 Hs.102267 

420931 AF044197 Hs.100431 

420813 X51501 Hs.99949 

452744 AI267652 Hs.30504 



EST s, Weakly similar to CP4Y.HUMAN CrTOC 
Homo sapiens breast cancer antigen NY-BR 
BMP-RIB 

hypothetical protein DKFZp56401278 

a disintegrin and metalloproteinase doma 

dual specificity phosphatase 4 nuclear 

dual specificity phosphatase 4 nuclear 

aldehyde dehydrogenase 9 family, member cytoplasm 

ESTs 

ESTs, Moderately similar to ALU8_HUMAN A 

N-acetyitransferase 1 (arytamine N-acety 

stannlocalcin 2 

GDNF family receptor alpha 1 

Homo sapiens cDNA RJ14035 fis, clone HE 

ESTs 

LIV-1 protein, estrogen regulated 
interleukin 6 signal transducer (gp130, 
hypothetical protein RJ10879 
ESTs 

ESTs, Weakly similar to S64054 hypoihet) 
Homo sapiens cDNA FU13603 fis, done PL 
solute canter family 16 (monocarboxyCc 
ESTs 

lymphoid nuclear protein (LAF-4) mRNA 
ATP-blnding cassette transporter MRP8 
Homo sapiens clone PP1498 unknown mRNA 
ESTs, Weakly similar to fatty acid omega 
lg superfamfly receptor LNIR 
DEME-6 protein 

5T4 oncofetal trophoblast glycoprotein 
Transmembrane protease, serine 3 
osteoblast specific factor 2 (fascfcHn 
CEGP1 protein 

UDP-N*acetyt-a^ha-D^alactosam[ne:pofyp 
NMJ14112*:Homo sapiens trichorhinophala mitochodria 
Phase2&3Exons nuclear 
KIAA1238 protein 

ENSP00000249072*:DJ222E13.1 (N-TERMINAL 
shorVchain alcohol dehydrogenase family . 
matrix metalloproteinase 10 (stromelysln 
collagen, type X, alpha 1 (Schmld metaph 
cytochrome P450, 51 (lanosterol 14-alpha 
Bpophilln B (uteroglobin family member) 
iysyl oxidase 

smaD Inducible cytokine B subfamily (Cy 
prolactin-induced protein nuclear 
Homo sapiens mRNA; cDNA DKFZp434E082 (fr 

343 



ER 

extracellular 



Seq ID 1 & 2 
Seq!D3&4 
SeqlD5&6 
SeqlD7&8 
Seq ID 9 & 10 
Seq ID 11 & 12 
Seq ID 11 & 12 
Seq ID 13 & 14 
Seq ID 15 & 16 
Seq ID 17 & 16 
Seq ID 19&20 
Seq ID 21 & 22 
Seq ID 23 & 24 
SeqlD25&26 
SeqlD27&28 
Seq ID 29 & 30 
Seq ID 31 & 32 
SeqlD33&34 
Seq 1D35&36 
Seq ID 37 & 38 
Seq ID 39 & 40 
SeqlD41&42 
Seq ID43&44 
SeqlD45&46 
Seq ID47&48 
Seq ID 49 & 50 
Seq ID 51 & 52 
SeqlD53&54 
Seq ID55&56 
SeqID57&58 
Seq ID 59 & 60 
Seq ID 61 & 62 
Seq ID63&64 
Seq (D 65 & 66 
Seq ID 67 & 68 
Seq ID 69 & 70 
Seq ID 71 & 72 
SeqlD73&74 
SeqlD75&76 
Seq 1D77&78 
SeqlD79&80 
Seq ID 81 & 82 
SeqlD83&84 
Seq 1D85&86 
Seq ID87&88 
Seq ID 89 & 90 
Seq ID 91 & 92 
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420757 X78592 Hs.99915 
42^905 NMJ02497Hs.1537O4 
429859 NH.007050HS.225952 
446921 AB012113 Hs.16530 
5 445537 AJ245671 Hs.12844 
428227 AA321649 Hs.2248 
424001 W67883 Hs.137476 
421727 Y13153 Hs.107318 
452838 U65011 Hs.30743 

10 419667 AU077005 Hs.92208 
414812 X72755 Hs.77367 
426320 W47595 Hs.1 69300 
422867 L32137 Hs.1584 
411558 M102670 Hs.70725 

15 417866 AW067903 Hs.82772 
428398 A1249368 Hs.98558 
431958 X63629 Hs.2877 
428722 U76456 Hs.190787 
412970 AB026436 Hs.177534 

20 421379 Y15221 Hs.103982 
415752 BE314524 Hs.78776 
444051 N48373 Hs.10247 
451110 AI955040 Hs.265398 
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TABLE 24A 

Table 24 A shows the accession numbers for those pkeys lacking unigerielD's for Table 24. 
For each probeset, we have listed the gene cluster number from which the oligonucleotides 
5 were designed. Gene clusters were compiled using sequences derived from Genbank ESTs 
and mRNAs. These sequences were clustered based on sequence similarity using Clustering 
and Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers 
for sequences comprising each cluster are listed in the "Accession" column. 

10 Pkey: Unique Eos probeset identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 

15 Pkey CAT number Accession 

335824 CH22J197FGJ19JUINfCE 
325372 C12J1S 
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TABLE 24B 

■ r ■ * 

4 

Table 24B shows the genomic positioning for those pkeys lacking unigene ID's and 
5 accession numbers in Table 24. For each predicted exon, we have listed the genomic 
sequence source used for prediction. Nucleotide locations of each predicted exon are also 
listed. , 

Pkey: Unique number corresponding to an Eos probeset 

1 0 Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et a)." refers to the publication 

, entitled The DNA sequence of human chromosome 22." Dunham i. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposltion: Indicates nucleotide positions of predicted exons. 



15 



Pkey Ref Strand NLposltion 
404561 9795980 Minus , 69039-70100 
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Table 25 

The 69 gene sequences identified to be overexpressed in breast cancer may be used to identify coding regions flora the public DNA databases (nr and htgs in 
Genbank). The sequences may be used to either identify genes that encode known proteins, or they may be used to predict the coding regions from genomic 
DNA using exon prediction algorithms, such as FGENESH (Salamov and Solovyev, 2000, Genome Res. 10:5 16-522) 

Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 1-1518 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 

I I I I I I 

ATGG AGCCCT CCTGGCTTCA GG AACTCATG GCTCACCCCT TCTTGCTGCT G ATCCTCCTC 60 
TGCATGTCTC TGCTGCTGTT TCAGGTAATC AGGTTGTACC AGAGGAGGAG ATGGATGATC 120 
AGAGCCCTGC ACCTGTTTCC TGCACCCCCT GCCCACTGGT TCTATGGCCA CAAGGAGTTT 180 
TACCCAGTAA AGGAGTTTGA GGTGTATCAT AAGCTG ATGG AAAAATACCC ATGTGCTGTT 240 
CCCTTGTGGG TTGGACCCTT TACG ATGTTC TTCAGTGTCC ATGACCCAG A CTATGCCAAG 300 
ATTCTCCTG A AAAGACAAGA TCCCAAAAGT GCTGTTAGCC ACAAAATCCT TGAATOCTGG 360 
GTTGGTCG AG G ACTTGTG AC CCTGGATGGT TCTAAATGGA AAAAGCACCG CCAGATTGTG 420 
AAACCTGGCT TCAACATCAG CATTCTGAAA ATATTCATCA CCATGATGTC TGAGAGTGTT 480 
CGGATGATGC TGAACAAATG GGAGGAACGC ATTGCCCAAA ACTCACGTCT GGAGCTCTTT 540 
CAACATGTCT CCCTGATGAC CCTGGACAGC ATCATGAAGT GTGCCTTCAG CCACCAGGGC 600 
AGCATCCAGT TGGACAGTAC CCTGGACTCA TACCTGAAAG CAGTGTTCAA CCTTAGCAAA 660 
ATCTCCAACC AGCGCATdAA CAATTTTCTA CATCACAACG ACCTGGTTTT CAAATTCAGC 720 
TCTCAAGGCC AAATCTTTTC TAAATTTAAC CAAGAACTTC ATCAGTTCAC AG AG AAAGTA 780 
ATCCAGGACC GG AAGGAGTC TCTTAAGGAT AAGCTAAAAC AAG ATACTAC TCAG AAAAGG 840 
CGCTGGGATT TTCTGGACAT ACTTTTGAGT GCCAAAAGCG AAAACACCAA AGATTTCTCT 900 
GAAGCAGATC TCCAGGCTGA AGTGAAAACG TTCATGTTTG CAGGACATGA CACCACATCC 960 
AGTGCTATCT CCTGGATCCT TTACTGCTTG GCAAAGTACC CTGAGCATCA GCAGAGATGC 1020 
CGAG ATG AAA TCAGGGAACT CCTAGGGGAT GGGTCTTCTA TTACCTGGG A ACACCTGAGC 1080 
CAGATGCCTT ACACCACGAT GTGCATCAAG GAATGCCTCC GCCTCTACGC ACCGGTAGTA 1 140 
AACATATCCC GGTTACTCGA CAAACCCATC ACCTTTCCAG ATGGACGCTC CTTACCTGCA 1200 
GGAATAACTG TGTTTATCAA TATTTGGGCT CTTCACCACA ACCCCTATTT CTGGG AAGAC 1260 
CCTCAGGTCT TTAACCCCTT GAGATTCTCC AGGG AAAATT CTGAAAAAAT ACATCCCTAT 1320 
GCCTTCATAC CATTCTCAGC TGGATTAAGG AACTGCATTG GGCAGCATTT TGCCATAATT 1380 
GAGTGTAAAG TGGCAGTGGC ATTAACTCTG CTCCGCTTCA AGCTGGCTCC AGACC ACTCA 1440 
AGGCCTCCCC AGCCTGTTCG TCAAGTTGTC CTCAAGTCCA AGAATGG AAT CCA TGTG TTT 1500 
GCAAAAAAAG TTTGCTAATT TTA AGTCCTT TCGTATAAGA ATTAATGAGA CAATTTTCCT 1560 
ACCAAAGGAA GAACAAAAGG ATAAATATAA TACAAAATAT ATGTATATGG TTGTTTGACA 1620 
AATTATATAA CTTAGG ATAC TTCTGACTGG TTTTGACATC CATTAACAGT AATTTTAATT 1 680 
TCTTTCCTGT ATCTGGTGAA ACCCACAAAA ACACCTQ AAA AAACTCAACC TGACTTCCAC 1740 
TGCGAAGGGA AATTATTGGT TTGTGTAACT AGTGGTAG AG TGGCTTTCAA GCATAGTTTG 1 800 
ATCA AAACTC CACTCAGTAT CTGCATTACT TTTATCTCTG CAAATATCTG CATG ATAGCT 1860 
TTATTCTCAG TTATCTTTCC CCAATAATAA AAAA 



Sen ID NQ: 2 Protein sequence: 

Protein Accession #: FGENESH predicted 

I 11 21 31 41 51 

MEPSWLQELM AHPFLLLILL CMSLIXFQVI RLYQRRRWMI RALHLFPAPP AHWFYGHKEF 60 
YPVKEFEVYH KLMEKYPCAV PLWVGPFTMF FSVHDPDYAK ILLKRQDPKS AVSHKILESW 120 
VGRGLVTLDG SKWKKHRQIV KPGFNISILK IFITMMSESV RMMLNKWEER IAQNSRLELF 1 80 
QHVSLMTLDS IMKCAFSHQG SIQLDSTLDS YLKAVFNLSK ISNQRMNNFL HHNDLVFKFS 240 
SQGQIFSKFN QELHQFTEKV IQDRKESLKD KLKQDTTQKR RWDFLDILLS AKSENTKDFS 300 
EADLQAEVKT FMFAGHDTTS SAJSWILYCL AKYPEHQQRC RDEIRELLGD GSSITWEHLS 360 
QMPYTTMCHC ECLRLYAPVV N1SRLLDKPI TFPDGRSLPA GITVFINIWA LHHNPYFWED 420 
PQVFNPLRFS RENSEKIHPY AFIPFSAGLR NCIGQHFAII ECKVAVALTL LRFKLAPDHS 480 
RPPQPVRQVV LKSKNGIHVF AKKVC 

Seq ID NO: 3 DNA sequence 

Nucleic Acid Accession #: NM J)52997 

Coding sequence: 100-4125 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CTAGTCTATA CCAGCAACGA CTCCTACATC GTCCACTCTG GGGATCTTAG AAAGATCCAT 60. 
AAAGCTGCCT CCCGGGGACA AGTCCGGAAG CTGGAGAAG A TG ACAAAGAG GAAGAAGACC 120 
ATCAACCTTA ATATACAAGA CGCCCAGAAG AGG ACTGCTC TACACTGGGC CTGTGTCAAT 1 80 
GGCCATG AGG AAGTAGTAAC ATTTCTGGTA G ACAGA AAGT GCCAGCTTGA CGTCCTTGAT 240 
GGCGAACACA GGACACCTCT GATGAAGGCT CTACAATGCC ATCAGGAGGC TTGTGCAAAT 300 
ATTCTGATAG ATTCTGGTGC CGATATAAAT CTCGTAGATG TGTATGGCAA CATGGCTCTC 360 
CATTATGCTG TTTATAGTGA GATTTTGTCA GTGGTGGCAA AACTGCTGTC CCATGGTGCA 420 
GTCATCG AAG TGCACAACAA GGCTAGOCTC ACACCACTTT TACTATCCAT AACGAAAAGA 480 
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AGTOAGCAAA TTGTGGAATT TTTGCTGATA AAAAATGCAA ATGCGAATGC AGTTAATAAG 540 
TATAAATGCA CAGCCCTCAT GCTTGCTGTA TGTCATGGAT CATCAGAGAT AGTTGGCATG 600 
CTTCTTCAGC AAAATGTTG A CGTCTTTGCT OCAG ATATAT GTGGAGTAAC TGCAGAACAT 660 
TATGCTGTTA CTTGTGGATT TCATCACATT CATG AACAAA TTATGG AATA TATACG AAAA 720 
TTATCTAAAA ATCATCAAAA TACCAATCCA GAAGGAACAT CTGCAGG AAC ACCTGATGAG 780 
GCTGCACCCT TOGCGGAAAG AACACCTGAC ACAGCTG AAA GCTTGGTGGA AAAAACACCT 840 
GATGAGGCTG CACCCTTGGT GGAAAGAACA CCTGACACGG CTGAAAGCTT GGTGGAAAAA 900 
ACACCTGATG AGGCTGCATC CTTGGTGGAG GGAACATCTG ACAAAATTCA ATGTTTGGAG 960 
AAAGCGACAT CTGGAAAGTT CGAACAGTCA GCAGAAGAAA CACCTAGGGA AATTACGAGT 1020 
CCTGCAAAAG AAACATCTGA GAAATTTACG TGGCCAGCAA AAGGAAGACC TAGGAAGATC 1080 
GCATGGG AG A A AAAAG AAG A CACACCTAGG GAAATTATGA GTCCCGCAAA AGAAACATCT 1 140 
GAGAAATTTA CGTGGGCAGC AAAAGGAAGA CCTAGG AAGA TCGCATGGGA GAAAAAAGAA 1200 
ACACCTGTAA AGACTGGATG CGTGGCAAGA GTAACATCTA ATAAAACTAA AGTTTTGGAA 1260 
AAAGGAAGAT CTAAGATGAT TGCATGTCCT ACAAAAGAAT CATCTACAAA AGCAAGTGCC 1320 
AATGATCAGA GGTTCCCATC AGAATCCAAA CAAGAGGAAG ATGAAGAATA TTCTTGTGAT 1380 
TCTCGG AGTC TCTTTG AG AG TTCTGCAAAG ATTCAAGTGT GTATACCTGA GTCTATATAT 1440 
CAAAAAGTAA TGGAGATAAA TAGAGAAGTA GAAGAGCCTC CTAAGAAGCC ATCTGCCTTC 1500 
AAGCCTGCCA TTGAAATGCA AAACTCTGTT CCAAATAAAG CCTTTGAATT GAAGAATGAA 1560 
CAAACATTGA GAGCAGATCC GATGTTCCCA CCAGAATCCA AACAAAAGGA CTATGAAGAA 1620 
AATTCTTGGG ATTCTGAGAG TCTCTGTGAG ACTGTTTCAC AGAAGGATGT GTGTTTACCC 1680 
AAGGCTACAC ATCAAAAAGA AATAGATAAA ATAAATOGAA AATTAGAAGA GTCTCCTAAT 1740 
AAAGATGGTC TTCTGAAGGC TACCTGCGGA ATGAAAGTTT CTATTCCAAC TAAAGCCTTA 1800 
GAATTGAAGG ACATGCAAAC TTTCAAAGCG GAGCCTCCGG GGAAGCCATC TGOCTTCGAG 1860 
CCTGCCACTG AAATGCAAAA GTCTGTCCCA AATAAAGCCT TGGAATTGAA AAATGAACAA 1920 
ACATGGAGAG CAGATGAG AT ACTCCCATCA GAATCCAAAC AAAAGGACTA TGAAGAAAAT 1980 
TCTTGGGATA CTGAGAGTCT CTGTGAGACT GTTTCACAGA AGGATGTGTG TTTACCCAAG 2040 
GCTGCGCATC AAAAAGAAAT AGATAAAATA AATGGAAAAT TAGAAGGGTC TCCTGTTAAA 21 00 
GATGGTCTTC TGAAGGCTAA CTGCGGAATG AAAGTTTCTA TTCCAACTAA AGCCTTAGAA 2160 
TTGATGGACA TGCAAACTTT CAAAGCAGAG CCTCCCGAGA AGCCATCTGC CTTCGAGCCT 2220 
GCCATTGAAA TGCAAAAGTC TGTTCCAAAT AAAGCCTTGG AATTGAAG AA TGAACAAACA 2280 
TTGAGAGCAG ATGAGATACT CCCATCAGAA TCCAAACAAA AGGACTATGA AGAAAGTTCT 2340 
TGGGATTCTG AGAGTCTCTG TGAGACTGTT TCACAGAAGG ATGTGTGTTT ACCCAAGGCT 2400 
ACACATCAAA AAGAAATAG A TAAAATAAAT GGAAAATTAG AAGAGTCTCC TGATAATGAT 2460 
GGTTTTCTG A AGGCTCCCTG CAGAATGAAA GTTTCTATTC CAACTAAAGC CTTAGAATTG 2520 
ATGGACATGC AAACTTTCAA AGCAGAGCCT CCCG AGAAGC CATCTGCCTT CGAGCCTGCC 2580 
ATTGAAATGC AAAAGTCTGT TCCA AATAAA GCCTTGGAAT TGAAGAATGA ACAAACATTG 2640 
AGAGCAGATC AGATGTTCCC TTCAGAATCA AAACAAAAGA AGGTTGAAGA AAATTCTTGG 2700 
G ATTCTGAG A GTCTCCGTG A GACTGTTTCA CAGAAGGATG TGTGTGTACC CAAGGCTACA 2760 
CATCAAAAAG AAATGGATAA AATAAGTGGA AAATTAGAAG ATTCAACTAG CCTATCAAAA 2820 
ATCTTGGATA CAGTTCATTC TTGTGAAAGA GCAAGGGAAC TTCAAAAAGA TCACTGTGAA 2880 
CAACGTACAG GAAAAATGG A ACAAATGAAA AAGAAGTTTT GTGTACTGAA AAAGAAACTG 2940 
TCAGAAGCAA AAG AAATAAA ATCACAGTTA GAGAACCAAA AAGTTAAATG GGAACAAGAG 3000 
CTCTGCAGTG TOAGATTGAC TTTAAACCAA GAAGAAGAGA AGAGAAGAAA TGCCGATATA 3060 
TTAAATGAAA AAATTAGGGA AGAATTAGGA AGAATCGAAG AGCAGCATAG GAAAGAGTTA 3120 
GAAGTGAAAC AACAACTTGA ACAGGCTCTC AGAATACAAG ATATAGAATT GAAGAGTGTA 3180 
GAAAGTAATT TGAATCAGGT TTCTCACACT CATGAAAATG AAAATTATCT CTTACATGAA 3240 
AATTGCATGT TGAAAAAGGA AATTGCCATG CTAAAACTGG AAATAGCCAC ACTGAAACAC 3300 
CAATACCAGG AAAAGGAAAA TAAATACTTT GAGGACATTA AGATTTTAAA AGAAAAGAAT 3360 
GCTGAACTTC AGATGACCCT AAAACTGAAA GAGGAATCAT TAACTAAAAG GGCATCTCAA 3420 
TATAGTGGGC AGCTTAAAGT TCTGATAGCT GAG AACACAA TGCTCACTTC TAAATTGAAG 3480 
G AAAA AC A AG ACAAAGAAAT ACTAG AGGCA GAAATTGAAT CACACCATCC TAGA CTGG CT 3540 
TCTGCTGTAC AAGACCATGA TCAAATTGTG ACATCAAGAA AA AGTC AAG A ACCTGCTTTC 3600 
CACATTGCAG GAGATGCTTG TTTGCAAAGA AAAATGAATG TTGATGTGAG TAGTAGGATA 3660 
TATAACAATG AGGTGCTCCA TCAACCACTT TCTGAAGCTC AAAGGAAATC CAAAAGCCTA 3720 
AAAATTAATC TCAATTATGC AGG AGATGCT CT AAG AG AAA ATACATTGGT TTCAGAACAT 3780 
GCACAAAG AG ACCAACGTG A AACACAGTGT CAAATG AAGG AAGCTGAACA CATGTATCAA 3840 
AACGAACAAG ATAATGTGAA CAAACACACT GAACAGCAGG AGTCTCTAGA TCAGAAATTA 3900 
TTTCAACTAC AAAGCAAAAA TATGTGGCTT CAACAGCAAT TAGTTCATGC ACATAAGAAA 3960 
GCTGACAACA AAAGCAAGAT AACAATTGAT ATTCATTTTC TTGAGAGGAA AATGCAACAT 4020 
CATCTCCTAA AAGAGAAAAA TGAGG AGATA TTTAATTACA ATAACCATTT AAAAAACCGT 4080 
ATATATCAAT ATGAAAAAGA GAAAGCAGAA ACAGAAAACT C ATG A G AG AC AAGCAGTAAG 4140 
AAACTTCTTT TGGAGAAACA ACAGACCAGA TCTTTACTCA CAACTCATGC TAGGAGGCCA 4200 
GTCCTAGCAT CACCTTATGT TGAAAATCTT ACCAATAGTC TGTGTCAACA GAATACTTAT 4260 
TTTAGAAGAA AAATTCATGA TTTCTTCCTG AAGCCTACAG ACATAAAATA ACAGTGTGAA 4320 
GAATTACTTG TTCACG AATT GCATAAAGCT GCACAGGATT CCCATCTACC CTG ATGATGC 4380 
AGCAGACATC ATTCAATCCA ACCAGAATCT CGCTCTGCAC TCCAGCCTAG GTGACAGAGT 4440 
GAGACTCCAC CTCGGAAA 



Seq ID NO: 4 Protein sequence; 
Protein Accession #: NP 443723.1 

I 11 21 31 41 51 

MTKRKKTTNL NIQDAQKRTa'lHWACVNGHE EWTFLVDRK CQLDVLDGEH RTPLMKALQC 60 
HQEACANILI DSGADINLVD VYGNMALHYA VYSEILSVVA KLLSHGAVIE VHNKASLTPL 120 
LLSITKRSEQ IVEFLLIKNA NANAVNKYKC TALMLAVCHG SSE1VGMLLQ QNVDVFAADI 180 
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CG VTA EHYAV TCGFHHIHEQ IMEY1RKLSK NHQNTNPEGT SAGTPDEA AP LAERTPDTAE 240 
SLVEKTPDEA APLVERTPDT AESLVEKTPD EAASLVEGTS DKIQCLEKAT SGKFEQSAEE 300 
TPREITSPAK ETSEKFTWPA KGRPRKIAWE KKEDTPREIM SPAKETSEKF TWAAKGRPRK 360 
IAWEKKETPV KTGCVARVTS NKTKVLEKGR SKMIACPTKE SSTKASANDQ RFPSESKQEE 420 
DEEYSCDSRS LFESSAKIQV CIPESIYQKV MEINREVEEP PKKPSAFKPA IEMQNSVPNK 480 
AFELKNEQTL RADPMFPPES KQKDYEENSW DSESLCETVS QKDVCLPKAT HQKEIDKINQ 540 
KLEESPNKDG LLKATCGMKV SIPTKALELK DMQTFKAEPP GKPSAFEPAT EMQKSVPNKA 600 
LELKNEQTWR ADEILPSESK QKDYEENSWD TESLCETVSQ KDVCLPKAAH QKEIDKJNGK 660 
LEGSPVKDGL UCANCGMKVS IPTKALELMD MQTFKAEPPE KPSAFEPAIE MQKSVPNKAL 720 
ELKNEQTLRA DEILPSESKQ KDYEESSWDS ESLCETVSQK DVCLPKATHQ KEIDKINGKL 780 
EESPDNDGFL KAPCRMKVSI PTKALELMDM QTFKAEPPEK PSAFEPAEEM QKSVFNKALE 840 ■ 
LKNEQTLRAD QMFPSESKQK KVEENSWDSE 5LRETVSQKD VCVPKATHQK EMDKISGECLE 900 
DSTSLSKILD TVHSCERARE LQKDHCEQRT GKMEQMKKKF CVLKKKLSEA KEDCSQLENQ 960 
KVKWEQELCS VRLTLNQEEE KRRNADILNE KIREELGRIE EQHRKELEVK QQLEQALRIQ 1020 
DIELKSVESN LNQVSHTHEN ENYLLHENCM LKKEIAMLKL EIATUCHQYQ EKENKYFEDI 1080 
KILKEKNAEL QMT^KLKEES LTKRASQYSG QLKVLIAENT MLTSKLKEKQ DKEILEAEIE 1140 
SHHPRLASAV QDHDQIVTSR KSQEPAFHIA GDACLQRKMN VDVSSTIYNN EVLHQPLSEA 1200 
QRKSKSLKIN LNYAGDALRE NTLVSEHAQR DQRETQCQMK EAEHMYQNEQ DNVNKHTEQQ 1260 
ESLDQKLFQL QSKNMWLQQQ LVHAHKKADN KSKJTIDIHF LERKMQHHLL KEKNEEIFNY 1 320 
NNHLKNRIYQ YEKEKAETEN S 

Seq ID NO: 5 DNA sequence 

Nucleic Acid Accession U: none found 

Coding sequence: 273-1785 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 . 

CGCGGGGCGC GG AGTCGGCG GGGCCTCGCG GGACGCGGGC AGTGCGGAGA CCGCGGCGCT 60 
GAGGACGCGG GAGCCGGGAG CGCACGCGCG GGGTGGAGTT CAGCCTACTC TTTCTTAGAT 120 
GTGAAAGGAA AGGAAGATCA TTTCATGCCT TGTTGATAAA GGTTCAGACT TCTGCTGATT 180 
CATAACCATT TGGCTCTGAG CTATGACAAG AGAGGAAACA AAAAGTTAAA CTTACAAGCC 240 
TGCCATAAGT GAGAAGCAAA CTTCCTTGAT AACATGCTTT TGCG AAGTGC AGG AAAATTA 300 
AATGTGGGCA CCAAGAAAGA GGATGGTGAG AGTACAGCCC CCACCCCCCG TCCAAAGGTC 360 
TTGCGTTGTA AATGCCACCA CCATTGTCCA GAAGACTCAG TCAACAATAT TTGCAGCACA 420 
GACGGATATT GTTTCACGAT GATAGAAG AG GATGACTCTG GGTTGCCTGT GGTCACTTCT 480 
GGTTGCCTAG GACTAOAAGG CTCAGATTTT CAGTGTCGGG ACACTCCCAT TCCTCATCAA 540 
AGAAG ATCAA TTGAATGCTG CACAGAAAGG AACGAATGTA ATAAAGACCT A CACO CTACA 600 
CTGCCTCCAT TGAAAAACAG AGATTTTGTT GATGGACCTA TAC ACCAC AG GGCTTTACTT 660 
ATATCTGTGA CTGTCTGTAG TTTGCTCTTG GTCCTTATCA TATTATTTTG TTACTTCCGG 720 
TATAAAAG AC AAGAAACCAG ACCTCGATAC AGCATTGGGT TAG AACAGG A TGAAACTTAC 780 
ATTCCTCCTG GAG AATCCCT G AG AGACTTA ATTG AGCAGT CTCAGAGCTC AGG AAGTGG A 840 
TCAGGCCTCC CTCTGCTGGT CCAAAGGACT ATAGCTAAGC AGATTCAGAT GGTGAAACAG 900 
ATTGGAAAAG GTCGCTATGG GGAAGTTTGG ATGGGAAAGT GGCGTGGCGA AAAGGTAGCT 960 
GTGAAAGTGT TCTTCACCAC AGAGGAAGCC AGCTGGTTCA GAGAGACAGA AATATATCAG 1020 
ACAGTGTTGA TGAGGCATGA AAACATTTTG GGTTTCATTG CTGCAGATAT CAAAGGGACA 1080 
GGGTCCTGGA CCCAGTTGTA CCTAATCACA GACTATCATG AAAATGGTTC CCTTTATGAT 1 140 
TATCTGAAGT CCACCACCCT AOACGCTAAA TCAATGCTGA AGTTAGCCTA CTCTTCTGTC 1200 
AGTGGCTTAT GTCATTTACA CACAGAAATC TTTAGTACTC AAGGCAAACC AGCAATTGCC 1260 
CATCGAGATC TGAAAAGTAA AAACATTCTG GTGAAGAAAA ATGGAACTTG CTGTATTGCT 1320 
GACCTGGGCC TGGCTGTTAA ATTTATTAGT GATACAAATG AAGTTGACAT ACCACCTAAC 1380 
ACTCGAGTTG GCACCAAACG CTATATGCCT CCAGAAGTGT TGG ACGAGAG CTTG AACAGA 1440 
AATC ACTTCC AGTCTTACAT CATGGCTG AC ATGTATAGTT TTGGCCTCAT CCTTTGGGAG 1 500 
GTTGCTAGGA GATGTGTATC AGGAGGTATA GTGGAAGAAT ACCAGCTTCC TTATCATGAC 1560 
CTAGTGCCCA GTGACCCCTC TTATGAGGAC ATGAGGG AGA TTGTGTGCAT CAAGAAGTTA 1620 
CGCCCCTCAT TCCCAAACCG GTGGAGCAGT GATGAGTGTC TAAGGCAGAT GGGAAAACTC 1680 
ATGACAGAAT GCTGGGCTCA CAATCCTGCA TCAAGGCTGA CAGCCCTGCG GGTTAAGAAA 1740 
ACACTTGCCA AAATGTCAGA GTCCCAGGAC ATTAAACTCT G ATAGGAGAG G AAAAGTAAG 1 800 
CATCTCTGCA GAAAGCCAAC AGGTACTCTT CTGTTTGTGG GCAGAGCAAA AGACATCAAA 1860 
TAAGCATCCA CAGTACAAGC CTTGAACATC GTCCTGCTTC CCAGTGGGTT CAGACCTCAC 1920 
CTTTCAGGGA GCGACCTGGG CAAAGACAGA G AAGCTCCCA GAAGGAG AGA TTGATCCGTG 1980 
TCTOTTTGTA GGCGGAGAAA CCGTTGGGTA ACTTGTTCAA G ATATG ATGC AT 



Seq ID NO: 6 Protein seouence: 
Protein Accession #: none found 



1 11 21 31 41 51 
I I I I I I 

MLLRSAGKLN VGTKKEDGES TAPTPRPKVL RCKCHHHCPE DSVNNICSTD GYCFTM1EED 60 
DSGLP VVTSG CLGLEGSDFQ CRDTPIPHQR RSIECCTERN ECNKDLHPTL PPLKNRDFVD 120 
GPIHHRALLl SVTVCSLLLV LIILFCYFRY KRQETRPRYS IGLEQDETYI PPGESLRD U 18 0 
EQSQSSGSGS GLPLLVQRTI AKQIQMVKQI GKGRYGEVWM GKWRGEKVAV KVFFTTEEAS 240 
WFRETEIYQT VLMRHENILG FIAADIKGTG SWTQLYLITD YHENGSLYDY LKSTTLDAKS 300 
MLKLA YSSVS GLCHLHTEIF STQGKPAIAH RDLKSKNILV KKNGTCCIAD LGLAVKFISD 360 
TNEVDIPPNT RVGTKRYMPP EVLDESLNRN HFQSYIMADM YSFGLILWEV ARRCVSGGIV 420 
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i • *■ 



EEYQLPYHDL VPSDPSYEDM RE1VCIKKLR PSFPNRWSSD ECLRQMGKLM TECWAHNPAS 480 

RLTALRVKKT LAKMSESQDI KL . 

■ 

Seq ID NO: 7 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 482-3007(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
111 I I I 

AACTOAGCTA ACAAGAAATA CTAGAAAAGG AGGAAGGAGA ACATTGCTGC AGCTTGGATC 60 
TACAACCTAA GAAAGCAAGA GTGATCAATC TCAGCTCTGT TAAACATCTT GTTTACTTAC 120 
TGCATTCAGC AGCTTGCAA A TGGTTA ACTA TATGCAAAAA AGTCAGCATA GCTGTG AAGT 1 80 
ATGCCGTG AA TTTTAATTGA GGGAAAAAGG GACAATTGCT TCAGG ATGCT CTAGTATGCA 240 
CTCTGCTTGA AATATTTTCA ATGAAATGCT CAGTATTCTA TCTTTGACCA GAGGTTTTAA 300 
CTTTATGAAG CTATGGGACT TGACAAAAAO TGATATTTGA GAAGAAAGTA CGCAGTGGTT 360 
GGTGTTTTCT TTTTTTTAAT AAAGGAATTG AATTACTTTG AACACCTCTT CCAGCTGTGC 420 
ATTACAGATA ACGTCAGGAA GAGTCTCTGC TTTACAGAAT CGGATTTCAT CACATGACAA 480 
CATG AAGCTG TGGATTCATC TCTTTTATTC ATCTCTCCTT GCCTGTATAT CTTTACACTC 540 
CCAAACTCCA GTGCTCTCAT CCAGAGGCTC TTGTGATTCT CTTTGCAATT GTGAGG AAAA . 600 
AG ATGGCACA ATGCT AATAA ATTGTG AAGC AAAAGGTATC AAG ATGGTAT CTGAAATAAG 660 
TGTGCCACCA TCACGACCTT TCCAACTAAG CTTATTAAAT AACGGCTTGA CGATGCTTCA 720 
CACAAATGAC TTTTCTGGGC TTACCAATGC TATTTCAATA CACCTTGGAT TTAACAATAT 780 
TGCAGATATT GAGATAGGTG CATTTAATGG CCTTGGCCTC CTGAAACAAC TTCATATCAA 840 
TCAC AATTCT TTAG AAATTC TTAAAG AGGA TACTTTCCAT GG ACTGG AAA ACCTGG AATT 900 
CCTGCAAGCA GATAACAATT TTATCACAGT GATTGAACCA AGTGCCTTTA GCAAGCTCAA 960 
CAGACTCAAA GTGTTAATTT TAAATGACAA TGCTATTGAG AGTCTTCCTC CAAACATCTT 1020 
CCGATTTGTT CCTTTAACCC ATCTAGATCT TCGTGGAAAT CAATTACAAA CATTGCCTTA 1080 
TGTTGGTTTT CTCGAACACA TTGGCCGAAT ATTG G ATCTT CAGTTGG AGG ACAACAAATG 1 140 
GGCCTGCAAT TGTGACTTAT TGCAGTTAAA AACTTGGTTG GAGAACATGC CTCCACAGTC 1200 
TATAATTGGT GATGTTGTCT GCAACAGCCC TCCATTTTTT AAAGGAAGTA TACTCAGTAG 1260 
ACTAAAGAAG GAATCTATTT GCCCTACTCC ACCAGTGTAT GAAGAACATG AGGATCCTTC 1320 
AGGATCATTA CATCTGGCAG CAACATCTTC AATAAATGAT AGTCGCATGT CAACTAAGAC 1380 
CACGTCCATT CTAAAACTAC CCACCAAAGC ACCAGGTTTG ATACCTTATA TTACAAAGCC 1440 
ATCCACTCAA CTTCCAGGAC CTTACTGCCC TATTCCTTGT AACTGCAAAG TCCTATCCCC 1500 
ATCAGGACTT CTAATACATT GTCAGGAGCG CAACATTGAA AGCTTATCAG ATCTGAGACC 1560 
TCCTCCGCAA AATCCTAG AA AGCTCATTCT AGCGGGAAAT ATTATTCACA GTTTAATG AA 1620 
GTCTGATCTA GTGGAATATT TCACTTTGGA AATGCTTCAC TTGGGAAACA ATCGTATTGA 1680 
AGTTCTTGAA GAAGGATCGT TTATGAACCT AACGAGATTA CAAAAACTCT ATCTAAATGG 1740 
TAACCACCTG ACCAAATTAA GTAAAGGCAT GTTCCTTGGT CTCCATAATC TTGAATACTT 1800 
ATATCTTGAA TACAATGCCA TTAAGGAAAT ACTGCCAGGA ACCTTTAATC CAATGCCTAA 1860 
ACTTAAAGTC CTGTATTTAA ATAACAACCT CCTCCAAGTT TTACCACCAC ATATTTTTTC 1920 
AGGGGTTCCT CTAACTAAGG TAAATCTTAA AACAAACCAG TTTACCCATC TACCTGTAAG 1980 
TAATATTTTG GATG ATCTTG ATTTACTAAC CCAGATTGAC CTTGAGG ATA ACCCCTGGGA 2040 
CTGCTCCTGT GACCTGGTTG GACTGCAGCA ATGGATACAA AAGTTAAGCA AGAACACAGT 2100 
GACAGATGAC ATCCTCTGCA CTTCCCCCGG GCATCTCGAC AAAAAGGAAT TGAAAGCCCT 2160 
AAATAGTGAA ATTCTCTGTC CAGGTTTAGT AAATAACCCA TCCATGCCAA CACAGACTAG 2220 
TTACCTTATG GTCACCACTC CTGCAACAAC AACAAATACG GCTGATACTA TTTTACGATC 2280 
TCTTACGGAC GCTGTGCCAC TGTCTGTTCT AATATTGGGA CTTCTG ATTA TGTTCATCAC 2340 
TATTGTTTTC TGTGCTGCAG GGATAGTGGT TCTTGTTCTT CACCGCAGGA G AAGATACAA 2400 
AAAG AAACA A GTAG ATGAGC AAATG AG AG A CAACAGTCCT GTGCATCTTC AGTACAGCAT 2460 
GTATGGCCAT AAAACCACTC ATCACACTAC TG AAAGACCC TCTGCCTCAC TCTATGAACA 2520 
GCACATGGTG AGCCCCATGG TTCATGTCTA TAGAAGTCCA TCCTTTGGTC CAAAGCATCT 2580 
GGAAGAGG AA GAAGAGAGGA ATGAGAAAGA AGGAAGTGAT GCAAAACATC TCCAAAGAAG 2640 
TCTTTTGG AA CAGGAAAATC ATTCACCACT CACAGGGTCA AATATGAAAT ACAAAACCAC 2700 
G AACCAATCA ACAG AATTTT TATCCTTCCA AG ATGCCAGC TCATTGTACA GAAACATTTT 2760 
AGAAAAAGAA AGGGAACTTC AGCAACTGGG AATCACAGAA TACCTAAGGA AAAACATTGC 2820 
TCAGCTCCAG CCTGATATGG AGGCACATTA TCCTGGAGCC CACGAAGAGC TGAAGTTAAT 2880 
GGAAACATTA ATGTACTCAC GTCCAAGGAA GGTATTAGTG GAACAGACAA AAAATGAGTA 2940 
TTTTGAACTT AAAG CT AATT TACATGCTGA ACCTGACTAT TTAGAAGTCC TGG AGCAGCA 3000 
AACATAGATG GAGAGTTTG A GGGCTTTCGC AGAAATGCTG TGATTCTGTT TTAAGTCCAT 3060 
ACCTTGTAAA TAAGTGCCTT ACGTGAGTGT GTCATCAATC AGAACCTAAG CACAGCAGTA 3120 
AACTATGGGG AAAAAAAAAG AAGAAGAAAA GAAACTCAGG GATCACTGGG AGAAGCCATG 3180 
GCATT ATCTT CAGGCAATTT AGTCTGTCCC AAATAAAATC AATCCTTGCA TGTAAATC 



Seq ID NO: 8 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 
I I I I I I 

MKLWIHLFYS SLLACISLHS QTPVLSSRGS CDSLCNCEEK DGTMUNCEA KGIKMVSEIS 60 
VPPSRPFQLS LLNNGLTMLH TNDFSGLTNA ISIHLGFNNI ADIEIGAFNG LGLLKQLHIN 120 
HNSLEILKED TFHGLENLEF LQADNNF1TV IEPSAFSKLN RLK.VLILNDN A1ESLPPNIF 180 
RFVPLTHLDL RGNQLQTLPY VGFLEHIGRI LDLQLEDNKW ACNCDLLQLK TWLENMPPQS 240 
IIGDWCNSP PFFKGSILSR LKKESICPTP PVYEEHEDPS GSLHLAATSS INDSRMSTKT 300 
TSILKLPTKA PGUPYTTKP STQLPGP VCP IPCNCKVLSP SGLLIHCQER NIESLSDLRP 360 
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PPQNPRKLIL AGN1IHSLMK SDLVEYFTLE MLHLGNNRIE VLEEGSFMNL TRLQKLYLNG 420 
NHLTKLSKGM FLGLHNLEYL YLEYNAIKEI LPGTFNPMPK LKVLYLNNNL LQVLPPHIFS 480 
GVPLTKVNLK TKQFTHLPVS N1LDDLDLLT QIDLEDNPWD CSCDLVGLQQ WIQKLSKNTV 540 
TDDILCTSPG HLDKKELKAL NSEILCPGLV NNPSMPTQTS YLMVTTPATT TNTADTURS 600 
LTDAVPLSVL ILGLL1MFIT I VFCAAGIW LVLHRRRRYK KKQVDEQMRD NSPVHLQYSM 660 
YGHKTTHHTT ERPSASLYEQ HMVSPMVHVY RSPSFGPKHL EEEEERNEKE GSDAKHLQRS 720 
LLEQENHSPL TG SNMKYKTT NQSTEFLSFQ DASSLYRNIL EKERELQQLG ITEYLRKNIA 780 
QLQPDMEAHY PGAHEEUCLM ETLMYSRPRK VLVEQTKNEY FELKANLHAE PDYLEVLEQQ 840 
T 

Seq ID NO: 9 DNA sequence 

Nucleic Acid Accession #: NM_003474 

Coding sequence: 307-3036 (underlined sequences correspond to start and stop codons) 



1 11 t\ 31 41 51 
111)11 

CACTAACGCT CTTCCTAGTC CCCGGGCCAA CTCGGACAGT TTGCTCATTT ATTGCAACGG 60 
TCAAGGCTGG CTTGTGCCAG AACGGCGCGC GCGCOACGCA CGCACACACA CGGGGGGAAA 120 
CTTTTTTAAA AATGAAAGGC TAGAAGAGCT CAGCGGCGGC GCGGGCCGTG CGCGAGGGCT 180 
CCGGAGCTGA CTCGCCGAGG CAGGAAATCC CTCCGGTCGC GACGCCCGGC CCCGCTCGGC 240 
G<XCGCGTG<3 GATGGTGCAG CGCTCGCCGC CGGGCCCGAG AGCTGCTGCA CTGAAGGCCG 300 
GCGACGATGG CAGCGCGCCC GCTGCCCGTG TCCCCCGCCC GCGCCCTCCT GCTCGCCCTG 360 
GCCGGTGCTC TGCTCGCGCC CTGCGAGGCC CGAGGGGTGA GCTTATGGAA CGAAGGAAGA 420 
GCTGATGAAG TTGTCAGTGC CTCTGTTCGG AGTGGGGACC TCTGGATCCC AGTGAAGAGC 480 
TTCGACTCCA AGAATCATCC AGAAGTGCTG AATATTCGAC TACAACGGGA AAGCAAAGAA 540 
CTGATCATAA ATCTGGAAAG AAATGAAGGT CTCATTGCCA GCAGTTTCAC GGAAACCCAC 600 
TATCTGCAAG ACGGTACTGA TGTCTCCCTC GCTCGAAATT ACACGGTAAT TCTGGQTCAC 660 
TGTTACTACC ATGG ACATGT ACGGGGATAT TCTG ATTCAG CAGTCAGTCT CAGCACGTGT 720 
TCTGGTCTCA GGGGACTTAT TGTGTTTGAA AATGAAAGCT ATGTCTTAGA ACCAATGAAA 780 
AGTGCAACCA ACAGATACAA ACTCTTCCCA GCGAAGAAGC TGAAAAGCGT CCGGGGATCA 840 
TGTGGATCAC ATCACAACAC ACCAAACCTC GCTGCAAAGA ATGTGTTTCC ACCACCCTCT 900 
CAGACATGGG CAAGAAGGCA TAAAAGAGAG ACCCTCAAGG CAACTAAGTA TGTGGAGCTG 960 
GTGATCGTGG CAGACAACCG AGAGTTTCAG AGGCAAGGAA AAGATCTGGA AAAAGTTAAG 1020 
CAGCGATTAA TAGAGATTGC TAATCACGTT GACAAGTTTT ACAGACCACT GAACATTCGG 1080 
ATCGTGTTGG TAGGCGTGGA AGTGTGGAAT GACATGGACA AATGCTCTGT AAGTCAGGAC 1 140 
CCATTCACCA GCCTCCATGA ATTTCTGGAC TGGAGGAAGA TG AAGCTTCT ACCTCGCAAA 1200 
TCCCATGACA ATGCGCAGCT TGTCAGTGGG GTTTATTTCC AAGGGACCAC CATCGGCATG 1260 
GCCCCAATCA TGAGCATGTG CACGGCAGAC CAGTCTGGGG GAATTGTCAT GGACCATTCA 1320 
GACAATCCCC TTGGTGCAGC CGTGACCCTG GCACATGAGC TGGGCCACAA TTTCGGGATG 1380 
AATCATGACA CACTGGACAG GGGCTGTAGC TGTCAAATGG CGGTTG AG AA AGG AGGCTGC 1440 
ATCATGAACG CTTCCACCGG GTACCCATTT CCCATGGTGT TCAGCAGTTG CAGCAGGAAG 1500 
GACTTGGAGA CCAGCCTGG A GAAAGGAATG GGGGTGTGCC TGTTTAACCT GCCGGAAGTC 1560 
AGGGAGTCTT TCGGGGGCCA GAAGTGTGGG AACAOATTTG TGGAAGAAGG AGAGGAGTGT 1620 
GACTGTGGGG AGCCAGAGG A ATGTATGAAT CGCTGCTGCA ATGCCACCAC CTGTACCCTG 1680 
AAGCCGGACG CTGTGTGCGC ACATGGGCTG TGCTGTGAAG ACTGCCAGCT GAAGCCTGCA 1 740 
GGAACAGCGT GCAGGGACTC CAGCAACTCC TGTGACCTCC CAGAGTTCTG CACAGGGGCC 1 800 
AGCCCTCACT GCCCAGCCAA CGTGTACCTG CACGATGGGC ACTCATGTC A GG ATGTGG AC 1860 
GGCTACTGCT ACAATGGCAT CTGCCAGACT CACGAGCAGC AGTGTGTCAC ACTCTGGGGA 1920 
CCAGGTGCTA AACCTGCCCC TGGGATCTGC TTTGAGAGAG TCAATTCTGC AGGTGATCCT 1980 
TATGGCAACT GTGGCAAAGT CTCGAAGAGT TCCTTTGCCA AATGCGAGAT GAGAQATGCT 2040 
AAATGTGGAA AAATCCAGTG TCAAGGAGGT GCCAGCCGGC CAGTCATTGG TACCAATGCC 2100 
GTTTCCATAG AAACAAACAT CCCCCTGCAG CAAGGAGGCC GGATTCTGTG CCGGGGGACC 2160 
CACGTGTACT TGGGCGATGA CATGCCGGAC CCAGGGCTTG TGCTTGCAGG CACAAAGTGT 2220 
GCAGATGGAA AAATCTGCCT GAATCGTCAA TGTCAAAATA TTAGTGTCTT TGGGGTTCAC 2280 
GAGTGTGCAA TGCAGTGCCA CGGCAGAGGG GTGTGCAACA ACAGGAAGAA CTGCCACTGC 2340 
GAGGCCCACT GGGCACCTCC CTTCTGTGAC AAGTTTGGCT TTGGAGGAAG CACAGACAGC 2400 
GGCCCCATCC GGCAAGCAGA TAACCAAGGT TTAACCATAG GAATTCTGGT GACCATCCTG 2460 
TGTCTTCTTG CTGCCGGATT TGTGGTTTAT CTCAAAAGGA AG ACCTTG AT ACGACTGCTG 2520 
TTTACAAATA AG AAG ACCAC CATTGA AAAA CTAAGGTGTG TGCGCCCTTC CCGGCCACCC 2580 
CGTGGCTTCC AACCCTGTCA GGCTCACCTC GGCCACCTTG GAAAAGGCCT GATGAGGAAG 2640 
CCGCCAGATT CCTACCCACC GAAGGACAAT CCCAGGAGAT TGCTGCAGTG TCAGAATGTT 2700 
GACATCAGCA G ACCCCTCAA CGGCCTGAAT GTCCCTCAGC CCCAGTCAAC TCAGCGAGTG 2760 
CTTCCTCCCC TCCACCGGGC CCCACGTGCA CCTAGCGTCC CTGCCAGACC CCTGCCAGCC 2820 
AAGCCTGCAC TTAGGCAGGC CCAGGGGACC TGTAAGCCAA ACCCCCCTCA GAAGCCTCTG 2880 
CCTGCAGATC CTCTGGCCAG AACAACTCGG CTCACTCATG CCTTGGCCAG GACCCCAGGA 2940 
CAATGGGAGA CTGGGCTCCG CCTGGCACCC CTCAGACCTG CTCCACAATA TCCACACCAA 3000 
GTGCCCAGAT CCACCCACAC CGCCTATATT AAGTGAOAAG CCGACAC CTT TTTTCAACAG 3060 
TGAAGACAGA AGTTTGCACT ATCTTTCAGC TCCAGTTGGA GTTTTTTGTA CCAACTTTTA 3120 
GGATTTTTTT TAATGTTTAA AACATCATTA CTATAAOAAC TTTGAGCTAC TGCCGTCAGT 3180 
GCTGTGCTGT GCTATGGTGC TCTGTCTACT TGCACAGGTA CTTGTAAATT ATTAAT TTAT 3240 
GCAGAATGTT GATTACAGTG CAGTGCGCTG TAGTAGGCAT TTTTACCATC ACTGAGTTTT 3300 
CCATGGCAGG AAGGCTTGTT GTGCTTTTAG TATTTTAGTG AACTTGAAAT ATCCTGCTTG 3360 
ATGGGATTCT GG ACAGGATG TGTTTGCTTT CTGATCAAGG CCTTATTGGA AAGCAGTCCC 3420 
CCAACTACCC CCAGCTGTGC TTATGGTACC AGATGCAGCT CAAGAGATCC CAAGTAGAAT 3480 
CTCAOTTG AT TTTCTGG ATT CCCCATCTCA GGCCAGAGCC A AGGGGCTTC AGGTCCAGGC 3540 
TGTGTTTGGC TTTCAGGGAG GCCCTGTGCC CCTTGACAAC TGGCAGGCAG GCTCCCAGGG 3600 
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ACACCTGGGA GAAATCTGGC TTCTGGCCAO GAAGCTTTGG TGAGAACCTG GGTTGCAGAC 3660 
AGQAATCTTA AGGTGTAGCC ACACCAGGAT AGAGACTGGA ACACTAGACA AGCCAGAACT 3720 
TGACCCTGAG CTGACCAGCC GTGAGCATGT TTGGAAGGGG TCTGTAGTGT CACTCA AGOC 3780 
GGTGCTTGAT AGAAATGCCA AGCACTTCTT TTTCTCGCTG TCCTTTCTAG AGCACTGCCA 3840 
CCAGTAGGTT ATTTAGCTTG GGAAAGGTGG TGTTTCTGTA AGAAACCTAC TGCCCAGGCA 3900 
CTGCAAACCG CCACCTCCCT ATACTGCTTG GAGCTGAGCA AATCACCACA AACT GTAATA 3960 
CAATG ATCCT GTATTCAGAC AGATGAGGAC TTTCCATGGG ACCACAACTA TTTTCAGATG 4020 
TGAACCATTA ACCAGATCTA GTCAATCAAG TCTGTTTACT GCAAGGTTCA ACTTATTAAC 4080 
AATTAGGCAG ACTCTTTATG CTTGCAAAAA CTACAACCAA TGGAATGTGA TGTTCATGGG 4140 
TATAGTTCAT GTCTGCTATC ATTATTCGTA GATATTGG AC AAAGAACCTT CTCTATGGGG 4200 
CATCCTCTTT TTCCAACTTG GCTGCAGGAA TCTTTAAAAG ATGCTTTTAA CAGAGTCTGA 4260 
ACCTATTTCT TAAACACTTG CAACCTACCT GTTGAGCATC ACAGAATGTG ATAAGGAAAT 4320 
CAACTTGCTT ATCAACTTCC TAAATATTAT GAGATGTGGC TTGGGCAGCA TCCCCTTGAA 4380 
CTCrrCACTC TTCAAATGCC TGACTAGGGA GCCATGTTTC ACAAGGTCTT TAAAGTGACT 4440 
A ATGGCATGA GAAATAGAAA AATACTCAGA TAAGGTAAAA TGCCATGATG CCTCTGTCTT 4500 
CTGG ACTGGT TTTCACATTA G AAGACAATT GACAACAGTT ACATAATTCA CTCTGAGTGT 4560 
TTTATGAGAA AGCCTTCTTT TGGGGTCAAC AGTTTTCCTA TGCTTTGAAA CAGAAAAATA 4620 
TGTACCAAG A ATCTTGGTTT GCCTTCCAG A AAACAAAACT GCATTTCACT TTCCCGGTGT 4680 
TCCCCACTGT ATCTAGGCAA CATAGTATTC ATGACTATGG ATAAACTAAA CACGTGACAC 4740 
AAACACACAC AAAAGGG AAC CCAGCTCTAA TACATTCCAA CTCGTATAGC ATGCATCTGT 4800 
TTATTCTATA GTTATTAAGT TCTTTAAAAT GTAAAGCCAT GCTGGAAAAT AATACTGCTG 4860 
AGATACATAC AGAATTACTG TAACTGATTA CACTTGGTAA TTGTACTAAA GCCAAACATA 4920 
TATATACTAT TAAAAAGGTT TACAGAATTT TATGGTGCAT TACGTGGGCA TTGTCTTTTT 4980 . 
AGATGCCCAA ATCCTTAGAT CTGGCATGTT AGCCCTTCCT CCAATTATAA G AGGATATGA 5040 
ACCAAAAAAA AAAAAAAAAA AA 



Seq ID NO: 10 Protein sequence; 
Protein Accession*: NPJ003465.2 

* 

I 11 21 31 41 51 
I I I I I I 

MAARPLPVSP ARALLLALAG AIXAPCEARG VSLWNEGRAD EWSASVRSG DLWjPVKSFD 60 
SKNHPEVLNI RLQRESKEU INLERNEGLI ASSFTETHYL QDGTDVSLAR NYTVILGHCY 120 
YHGHVRGYSD SAVSLSTCSG LRGLIVFENE SYVLEPMKSA TNRYKLFPAK KLKSVRGSCG 180 
SHHNTPNLAA KNVFPPPSQT WARRHKRETL KATKYVELV1 VADNREFQRQ GKDLEKVKQR 240 
LIE1ANHVDK FYRPLNIRIV LVGVEVWNDM DKCSVSQDPF TSLHEFLDWR KMKLLPRKSH 300 
DNAQLVSGVY FQGTTIGMAP IMSMCTADQS GG1VMDHSDN PLGAAVTLAH ELGHNFGMNH 360 
DTLDRGCSCQ MAVEKGGC1M NASTGYPFPM VFSSCSRKDL ETSLEKGMGV CLFNLPEVRE 420 
SFGGQKCGNR FVEEGEECDC GEPEECMNRC CNATTCTLKP DAVCAHGLCC EDCQLKPAGT 480 
ACRDSSNSCD LPEFCTGASP HCPANVYLHD GHSCQDVDGY CYNGICQTHE QQCVTLWGPG 540 
AKPAPGICFE RVNSAGDPYG NCGKVSKSSF AKCEMRDAKC GKIQCQGG AS RFVIGTNAVS 600 
IETNIPLQQG GRILCRGTHV YLGDDMPDPG LVLAGTKCAD GKICLNRQCQ N1SVFGVHEC 660 
AMQCHGRG VC NNRKNCHCEA HWAPPFCDKF OFGGSTDSGP IRQADNQGLT IGJLVTILCL 720 
LAAGFWYLK RKTLIRLLFT NKKTTIEKLR CVRPSRPPRG FQPCQAHLGH LGKGLMRKPP 780 
DSYPPKDNPR RLLQCQNVDI SRPLNGLNVP QPQSTQRVLP PLHRAPRAPS VPARPLPAKP 840 
ALRQAQGTCK PNPPQKPLPA DPLARTTRLT HALARTPGQW ETGLRLAPLR PAPQYPHQVP 900 
RSTHTAYIK 

Seq ID NO; 1 1 DNA sequence 

Nucleic Acid Accession #: NM_001394 

Coding sequence; 400-1 5 84{ underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCACQAGGG CTGAGCGCCG GAGGAGCGTA GGCAGGGCAO CGCTGGCGCC AGCGGCGACA 60 
GGAGCCGCGC GACCGGCAAA AATACACGGG AGGCCGTCGC CGAAAAGAGT CCGCGGTCCT 120 
CTCTCGTAAA CACACTCTCC TCCACCGGCG CCTCCCCCTC CGCTCTGCGC GCCGCCCGGC 180 
TGGGCGCCCG AGGCCGCTCC GACTGCTATG TGACCGCGAG GCTGCGGGAG GAAGGGGACA 240 
GGGAAGAAGA GGCTCTCCCG CGGGAGCCCT TGAGGACCAA GTTTGCGGCC ACTTCTGCAG 300 
GCGTCCCTTC TTAGCTCTCG CCTGCCCCTT TCTGCAGCCT AGGCGGCCCA GGTTCTCTTC 360 
TCTTCCTCGC GCGCCCAGOC GCCTCGGTTC CCGGCGAC CA TGG TGACGAT GGAGGAGCTG 420 
CGGG AGATGG ACTGCAGTGT GCTCAAAAGG CTG ATGAACC GGGACGAGAA TGGCGGCGGC 480 
GCGGGCGGCA GCGGCAGCCA CGGCACCCTG GGGCTGCCGA GCGGCGGCAA GTGCCTGCTG 540 
CTGG ACTGCA GACCGTTCCT GGCGCACAGC GCGGGCTACA TCCTAGGTTC GGTCAACGTG 600 
CGCTGTAACA CCATCGTGCG GCGGCGGGCT AAGGGCTCCG TGAGCCTGGA GCAGATCCTG 660 
CCCGCCG AGG AGGAGGTACG CGCCCGCTTG CGCTCCGGCC TCTACTOGGC GGTCATCGTC 720 
TACGACGAGC GCAGCCCGCO CGCCGAGAGC CTCCGCGAGG ACAGCACCGT GTCGCTGGTG 780 
GTGCAGGCGC TGCGCCGCAA CGCCGAGCGC ACCGACATCT GCCTGCTCAA AGGCGGCTAT 840 
GAG AGGTTTT CCTCCGAGTA CCCAGAATTC TGTTCTAAAA CCAAGGCCCT GGCAGCCATC 900 
CCACCCCCGG TTCCCCCCAG CGCCACAGAG CCCTTGGACC TGGGCTGCAG CTCCTGTGGG 960 
ACCCCACTAC ACGACCAGGG GGGTCCTGTG GAGATCCTTC CCTTCCTCTA CCTCGGCAGT 1020 
GCCTACCATG CTGCCCGGAG AOACATGCTG GACGCCCTGG GCATCACGGC TCTGTTGAAT 1080 
GTCTCCTCGG ACTGCCCAAA CCACTTTGAA GGACACTATC AGTACAAGTG CATCCCAGTG 1140 
GAAGATAACC ACAAGGCCGA CATCAGCTCC TGGTTCATGG AAGCCATAGA GTACATCGAT 1200 
GCCGTGAAGG ACTGCCGTGG GCGCGTGCTG GTGCACTGCC AGGCGGGCAT CTCGCGGTCG 1260 
GCCACCATCT GCCTGGCCTA CCTGATGATG AAGAAACGGG TGAGGCTGGA GGAGGCCTTC 1320 
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O AGTTCOTTA AGCAGCGCCO CAGCATCATC TCGCCCAACT TCAGCTTCAT GGGGCAGCTG 1 380 
CTGCAGTTCG AGTCCCAGGT GCTGGCCACG TCCTGTGCTG CGGAGGCTGC TAGCCCCTCG 1440 
GGACCCCTGC GGGAGCGGGG CAAG ACCCCC GCCACCCCCA CCTCGCAGTT CGTCTTCAGC 1500 
TTTCCGGTCT CCGTGGGCGT GCACTCGGCC CCCAGCAGCC TGCCCTACCT GCACAGCCCC 1560 
ATCACCACCT CTCCCAGCTG TTAGAGCCOC CCTGGGGGCC CCAGAACCAG AGCTGGCTCC 1620 
CAGCAAGGGT AGGACGGGCC GCATGCGGCA GAAAGTTGGG ACTGAGCAGC TGGGAGCAGG 1 680 
CG ACCGAGCT CCTTCCCCAT CATTTCTCCT TGGCCAACGA CG AGGCCAGC CAG AATGGCA 1 740 
ATAAGGACTC CG A AT AC ATA ATAAAAGCAA ACAGAACACT CCAACTTAGA GCAATAACCG 1800 
GTGCCGCAGC AGCCAGGGAA GACCTTGGTT TGGTTTATGT GTCAGTTTCA CTTTTCCGAT 1 860 
AGAAATTTCT TACCTCATTT TTTTAAGCAG TAAGGCTTGA AGTGATGAAA CCCACAGATC 1920 
CTAGCAAATG TGOCCAACCA GCTTTACTAA AGGGGGAGGA AGGGAGGGCA AAGGGATGAG 1980 
AAGACAAGTT TCCCAG AAGT GCCTGGTTCT GGGTACTTGT CCCTTTGTTG TCGTTGTTGT 2040 
AGTTAAAGGA ATTTCATTTT TAAAAG AAAT CTTCGAAGGT GTGGTTTTCA TTTCTCAGTC 2100 
ACCAACAGAT GAATAATTAT GCTTAATAAT AAAGTATTTA TTAAGACTTT CTTCAGAGTA 2160 
TGAAAGTACA AAAAGTCTAG TTACAGTGGA TTTAGAATAT ATTTATGTTG ATGTCAAACA 2220 
GCTGAGCACC GTAGCATGCA GATGTCAAGG CAGTTAGGAA GTAAATGGTG TCTTGTAGAT 2280 
ATGTGCAAGG TAGCATGATG AGCAACTTGA GTTTGTTGCC ACTGAGAAGC AGGCGGGTTG 2340 
GGTGGGAGGA GGAAGAAAGG GAAGAATTAG GTTTGAATTG CTTTTTAAAA AAAAAAGAAA 2400 
AGAAAAAGAC AGCATCTCAC TATGTTGCCA AGGCTCATCT TGAGAAGCAG GCGGGTTGGG 2460 
TGGG AGGAGG AAG AAAGGGA AGAATTAGGT TTG AATTGCT TTTTTAAAAA AAAA 

Seq ID NO: 12 Protein sequence: ( 
Protein Accession #: NPJW1385 

1 11 21 31 41 51 

MVTMEELREM DCSVLKRLMN RDENGGGAGG SGSHGTLGLP SGGKCLLLDC RPFLAHSAGY 60 
ILGSVNVRCN TTVRRRAKGS VSLEQ1LPAE EEVRARLRSG LYSAVTVYDE RSPRAESLRE 120 
DSTVSLWQ A LRJRNAERTDI CLUCGGYERF SSEYPEFCSK TKALAAIPPP VPPSATEPLD 1 80 
LGCSSCGTPL HDQGGPVEIL PFLYLGSAYH AARRDMLDAL GITALLNVSS DCPNHFEGHY 240 
QYKCIPVEDN HKADISSWFM EAIEYIDAVK DCRGRVLVHC QAGISRSATI CLAYLMMKKR 300 
VRLEEAFEFV KQRRSIISPN FSFMGQLLQF ESQVLATSCA AEAASPSGPL RERGKTPATP 360 
TSQFVFSFPV SVGVHSAPSS LPYLHSPITT SPSC 

Seq ID NO: 1 3 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 68-S40(undcrlmcd sequences correspond to start and stop codons) 



I 11 21 31 41 51 

I I I I. I I 

AGCGCCTTGC CTTCTCTTAG GCTTTGAAGC ATTTTTGTCT GTGCTCCCTG ATCTTCAGGT 60 
CACCACCATQ AAGTTCTTAG CAGTCCTGGT ACTCTTGGGA GTTTCCATCT TTCTGGTCTC 120 
TGCCCAGAAT CCGACAACAG CTGCTCCAGC TGACACGTAT CCAGCTACTG GTCCTGCTGA 1 80 
TGATGAAGCC CCTGATGCTG AAACCACTGC TGCTGCAACC ACTOCQACC A CTGC TGCTCC 240 
TACCACTGCA ACCACCGCTG CTTCTACCAC TGCTCGTAAA GACATTCCAG TTTTACCCAA 300 
ATGGGTTGGG GATCTCCCGA ATGGTAGAGT GTGTCCCTGA GATGGAATCA GCTTGAGTCT 360 
TCTGCAATTG GGTCACAACT ATTCATGCTT CCTGTGATTT CATCCAACTA CTTACCTTGC 420 
CTACGATATC CCCTTTATCT CTAATCAGTT TATTTTCTTT CAAATAAAAA ATAACTATGA 480 
GCGAGCTAAC AT 



Seq ID NO: 14 Protein sequence: 
Protein Accession #: none found 

I 11 21 31 41 51 

II ) I I I 

MKFLAVLVLL GVS1FLVSAQ NPTTAAPADT YPATGPADDE APDAETTAAA TTATTAAPTT 60 
ATTAASTTAR KDIPVLPKWV GDLPNGRVCP 

Seq ID NO: 15 DNA sequence 

Nucleic Acid Accession ft NM_016640.2 

Coding sequence: 39-1 358(underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

GCITAAGTTG ACCTCTGGGT CCGG AATCGC GGGCAA AG AT G.GCGGCGGCC AGGTGTTGGA 60 
GGCCTTTGCT ACGCGGTCCG AGGCTTTCAT TGCACACCGC GGCTAATGCC GCCGCCACGG 120 
CTACAGAAAC GACCTCCCAA GACGTCGCGG CGACCCCCGT CGCGCGGTAC CCGCCGATTG 180 
TGGCCTCCAT GACAGCCGAC AGCAAAGCTG CACGGCTGCG GCGGATCGAG CGCTGGCAGG 240 
CGACGGTGCA CGCTGCGGAG TCGGTAGACG AGAAGCTGCG AATCCTCACC AAGATGCAGT 300 
TTATG AAGTA CATGGTTTAC CCGCAGACCT TCGCACTGAA TGCCGACCGC TGGTACCAGT 360 
ACTTCACCAA GACCGTGTTC CTGTCGGGTC TGCCGCCGCC CCCAGCGGAG CCCGAGCCCG 420 
AGCCCGAACC CGAACCTGAA CCTGCGCTGG ACCTCGCGGC GCTGCGTGCG GTCGCCTGCG 480 
ACTGCCTGCT GCAGGAGCAC TTCTACCTGC GGCGCAGOCG GCGCGTGCAC CGTTACGAGG 540 
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i 

* 

AGAGCGAGGT CATATCTTTG CCCTTCCTGG ATCAGCTGGT GTCAACCCTC GTGGGCCTCC 600 
TCAGCCCACA CAACCCGGCC CTGGCCGCTG CCGCCCTCGA TTATAGATGC CCAGTTCATT 660 
TTTACTGGGT GCGTGGTGAA GAAATTATTC CTCGTGGTCA TCGA AGAGGT COAATTGATG 720 
ACTTGCGATA CC AG ATAGAT G ATAAACCAA ACA ACCAG AT TCGAATATCC AAGCAACTCG 780 
CAGAGTTTGT GCC ATTGGAT TATTCTGTTC CTATAGAAAT CCCCACTATA AA ATGTAAAC 840 
CAGACA AACT TCCATTATTC AAACGGCAGT ATG AAAACCA CATATTTGTT GGCTCAAAAA 900 
CTGCAGATCC TTGCTGTTAC GGTCACACCC AGTTTCATCT GTTACCTGAC AAATTAAGAA 960 
GGGAAAGGCT TTTGAGACAA AACTGTGCTG ATCAGATAGA AGTTGTTTTT AGAGCTAATG 1020 
CTATTGCAAG CCTTTTTGCT TGGACTGGAG CACAAGCTAT GTATCAAGGA TTCTGGAGTG 1080 
AAGCAGATGT TACTCGACCT TTTGTCTCCC AGGCTGTGAT CACAGATGG A AAATACTTTT 1 140 
CCTTTTTCTG CTACCAGCTA AATACTTTGG CACTGACTAC ACAAGCTGAT CAAAATAACC 1200 
CTCGTAAAAA TATATGTTGG GGTACACAAA GTAAGCCTCT TTATO AAACA ATTGAGGATA 1260 
ATGATGTGAA AGGTTTTAAT GATGATGTTC TACTTCAGAT AGTTCACTTT CTACTGAATA 1320 
GACCAAAAG A AGAAAAATCA CAGCTGTTGG AAAACJGA AA AAGCATATTT G ATTGAGAAC 1380 
TGTGGGAATA TTTAAATTTT ACTGAAGGAA CAATAATQAT GAGATTTGTA ACTGTCAACT 1440 
ATT A AATACA TTG ATTTTTG AG ACAAATAT TTCTTATGTC AA CCTGT TAT TAG ATCTCTT 1500 
ACTCTGCTCA AATTCATCAC TGAAAGATTT AATTTTAGTT ACCTTTTGTT GATTTAAAAA 1560 
TAATTGCATT TGTATATTGC TAACTGATAA GACAAATTGA GTTATTGAGC TATTAAATGC 1620 
ACATTTTAAT ATAAATGCAG AAATCCCAAA TAAAATGCTA ACATACTGAA TTCAGTAATT 1680 
AAAAGAACCC ACTGC 



Sea ID NO: 16 Protein sequence; 
Protein Accession #: NPJ057724.1 

1 11 21 31 41 51 

I I.I I I I 

MAAARCWRPL LRGPRLSLHT AANAAATATE TTSQDVAATP VARYPPIVAS MTADSKAARL 60 
RRIERWQATV HAAESVDEKL RILTKMQFMK YM VYPQTFAL NADRWYQYFT KTVFLSGLPP 1 20 
PPAEPEPEPE PEPEPALDLA ALRA VACDCL LQEHF YLRRR RR VHRYEESE VISLPFLDQL 1 80 
VSTLVGLLSP HNPALAAAAL DYRCPVHFYW VRGEEI1PRG HRRGRIDDLR YQIDDKPNNQ 240 
IRISKQLAEF VPLDYSVPIE IPTKCKPDK LPLFKRQYEN HIFVGSKTAD PCCYGHTQFH 300 
LLPDKLRRER LLRQNCADQI EWFRANAIA SLFAWTGAQA MYQGFWSEAD VTRPFVSQAV 360 
ITDGKYFSFF CYQLNTLALT TQADQNNPRK N1CWGTQSKP LYETIEDNDV KGFNDDVLLQ 420 
1VHFLLNRPK EEKSQLLEN 

Seq ID NO: 17 DNA sequence 

Nucleic Acid Accession #: NM.025059.1 

Coding sequence: 3-2 1 50 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GCATGAGCCT GGACTGCACC AGCCATATCG CGCTGGGTGC CGCTTCGCCA GCGCCCGAGG 60 
AAACTTACGA TCATCTTTCG GAAGTCCCGG TCACGCGGGA GCAGTTAAAC CACTATCGGA 120 
ATGTGGCTCA AAATGCTCGA AGTGAACTTG CAGCAACTTT GGTCAAATTT GAATGTGCTC 180 
AGTCTGAGCT TCAAGACCTC CGATCCAAGA TGCTTTCTAA AGAAGTCTCC TGTCAAGAAC 240 
TGAAAGCTGA AATGGAGAGC TACAAGGAAA ACAATGCCAG AAAATCATCT CTCCTTACCT 300 
CTTTGAGAGA CAGAGTTCAG GAACTAGAAG AAGAATCAGC AGCACTTTCC ACTTCTAAAA 360 
TCAGAACAGA AATCACAGCT CACGCTGCAA TCAAGGAGAA CCAGGAATTA AAGAAGAAAG 420 
TTGTAGAGTT AAATG AAAAA TTACAAAAGT GTTCAAAAGA AAATGAGGAG AATAAGAAAC 480 
AAGTTTCAAA GAATTGCAGG AAACATGAGG AATTTCTOAC TC AACTG CGT GACTGCTTOG 540 
ATCCAGATGA GAGGAATGAC AAGGCATCAG ATGAAGATTT AATTTTAAAG CTTAGAGACC 600 
TGCGCAAAGA AAATG AATTC GTGAAAGGAC AAATTGTTAT TCTTGAAGAG ACTATAAATG 660 
TCCATG AG AT GGA AGCAAAA GCTAGC AG AG AAACGATCAT G AGGCTGGCT TCAGAAGTCA 720 
ACAGAGAGCA GAAAAAAGCT GCCTCCTGTA CTGAAGAG AA AGAGAAGCTG AACCAGGACC 780 
TGCTCAGTGC TGTAG AAGCA AAAGAAGCTC TTGAAAGGGA AGTTAAGATC TTCCAAGAAA 840 
GGCTGCTTGC TGGCCAGCAG GTCTGGGATG CCTCAAAGCA GGAAGTGAGC CTCCTGAAGA 900 
AAAGCTCTTC TG AGTTGGAG AAGAGTTTGA AGGCCAGTCA GG ATGCAGTC ACAACCTCAC 960 
AAAGCCAGTA CTCCTCATTT AGGGAGAAAA TCGCAGCCCT CCTTAGGGGC AG ATTGAGCA 1020 
TGACTGGGTC CACTGAGGAC ACCATTTTGG AGAAGATTCG AGAAATGGAC AGCCGGGAAG 1080 
AAAGCAGGGA CCGGATGGTC TCCCAGCTTG AAGCCCAAAT ATCTGAGCTT GTTGAACAGT 1 140 
TGGGAAAGGA GTCTGGGTTT CACCAGAAAG CTCTCCAGAG GGCCCAGAAA GCAGAGAATA 1200 
TGTTGGAGAC TCTTCAGGGT CAGCTGACAC ACCTGGAGGC AGAGCTGGTT TCTGGAGGTO 1260 
TTTTGCGAGA CAACTTGAAT TTTGAGAAAC AAAAATATCT TAAAT TTCTG GATCAGCTTT 1320 
CTCAGAAAAT GAAGTTGGAC CAGATGGCTG CCGAACTTGG CTTTGACACG CGGCTGGACG 1380 
TGGTTTTAGC TCGAACAGAG CAGCTGGTTC GTCTTGAGAG CAATGCAGTC ATTGAGAACA 1440 
AGACCATTGC CCACAATTTG CAGAGAAAGC TAAAGACACA GAAAGAGAGA CTGGAGAGCA 1500 
AAGAATTACA CATGAGCCTC CTCCGGCAGA AAATAGCCCA GCTGGAGGAG GAGAAGCAGG 1560 
CACGCACGGC CTTGGTGGTT G AGAGGGACA ACGCGCATCT TACCATCAGG AACTTGCAGA 1620 
AGAAGGTGGA GAGGCTGCAG AAAGAGCTGA ACACGTGTCG AGACTTGCAC ACCGAGCTCA 1680 
AAG CCA AACT GGCCGACACC AATGAACTGA AGATTAAAAC TTTGGAACAG ACTAAAGCCA 1740 
TTGAAGATCT AAACAAATCC AGAGACCAAC TGGAGAAGAT GAAGGAGAAA GCTGAGAAAA 1800 
AGCTCATGTC TGTCAAGTCA GAACTGGATA CCACAGAACA TGAGGCTAAG GAOAATAAAG 1860 
AAAGGGCCAG AAACATGATA GAAGTGGTAA CCAGTGAAAT GAAGACACTA AAAAAATCTC 1920 
TGGAAGAAGC AGAAAAGAGA GAAAAGCAGC TGGCAGACTT CAGGGAGGTG GTGTCGCAGA 1980 
TGCTAGGCTT G AACGTG ACC AGCCTTGCTC TTCCTGATTA TGAAATCATC AAGTGTCTTG 2040 
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AAAGATTGGT CCATTCACAT C AG CATC ACT TTGTTACCTG TGCCTGCCTC AAAGATGTGA 2100 
CTACTGGGCA AGAGAGGCAC CCACAAGGCC ATTTACAGCT TCTTCATTJSA ACACTGTATC 2160 
TCTTGAGAG A GGTGGCCATA AGACATGGCA CACAATTCCC AATTTCACAA ATTCCTCATG 2220 
TCTTTGAGAT TTG ATCAGTT TGTGAATATT TTATGCTTTG ATG ATATAGT 0 AGAATGCAT 2280 
CACTTGCAAA AACGATCTCA AAAGTGTCAG CCTTAGATAA ACGTCAGCAT TAAAAAACGC 2340 
CAA A AA AAAA AAAAAAAAGC ATTTTAGG AT CCAGAAGAAT TCCACCAG AT TGCATG AGTT 2400 
AGATTGGG AA ATGGGAGTGG GAGATAATAT TGGQAGGTAT CTATTTTAAG TCAGGGOCTT 2460 
TACTAGCCG A TTTAGTTCTC ACAATAACCA TGTGGAGAAG CTGTGACATT TTTAATTTAC 2520 
AACCTTTCTG GGGCTCAGAC ATAAAGTTAC CTATCCAAGG TTGCAGTTGG GTAGTGGTGG 2580 
GACCAGGATG GACAACTCAT TGGCCCTGCC TCAAAAGCCA TACCTCTTCT CCTGCTATGC 2640 
AGAATCTGTT TCTCCTG AAT CTCTGTGATG CTGGTGGGAA TTGTTTGCAT AG AGGAAGGA 2700 
CAATAACCCT GCCATCGTGA GTTAATGTCC GGGCTGGTCA CAGTGGTTCA TGCCTGTAAT 2760 
CCCAGCACTT TGGGAGTCCA AGGCAGGCAT ATCATTTGAG GTCAGGAGTT TAAGACCAGC 2820 
CTGGCTAACA TAGTGAGACC CTGTTTCTAC TAAAAATACA AAAATAAGCC AGGTGTGGTG 2880 
GTGCATGACT GTAATCCCAA CTACTCAGCA GGAGAAGCAC TTGAACCCAG GAGACGGAGG 2940 
CGGCAGTGAG CCAAGATTGT GCCACTGCAC TCCAGCCTGG GCGACAGAGT GAAACTACAT 3000 
CTCAGAAAAA AAAAAAAAAA AAAAAAAAAA AAGTTAATGT CCAAAAATGA CAGATTTACA 3060 
AGTGTAAGCT ATATGATTTC TTCAAAAAGC AAAAGCAATA TACCTAATTC ATTTGGATCA 3120 
AACTTACATA GGTCTCAGGT CCTGTAAGAA ACTTGCCTGT TCTAACTGTT GCTACCAGAT 3180 
TATATCTGGT GGTAATTGTT AATGTTTCAG CAGGGCTGGT CTCAGTCCTT TAAAATGGAA 3240 
AGCTTTATTT GG AGCCCAAC CCTATAAGAT GAAG AAATCC TATATAGTCT TATTCACCAA 3300 
TATATCCAAT ACACCCACAG CAATGGTACG TTTTTAAGAT CAGGATTTTA TTATGAATTC 3360 
CTGTCACTTT CTGTTTTCCA TTTAAATTTC TATTTTACAA ATTTTTCAGG G AATCATATT 3420 
CTTA ACTTCA CTGAGAAAAA TGTGTTACTC TTTTGGACAA TTTATCTTAT TTCTATCATA 3480 
TAAGATGTAT TTTTTTATTG TCCTTAAAAG AAGCTCTAGC ATG AAATTAA AGGAAAGGGA 3540 
AAG AAATAGA TCTGGTGCAC CCGAACATTA GGAGAAAATG AAAAATATAC AACCAACCGT 3600 
TCGTG AGTCA TCAAAAAGTC AAAGTCAGCC TGGCCAACAT GGCAAAACTC CGTCTCTGCA 3660 
AAAAATACAA AAATGAGCCC GGTATGTTGG CATATGCCTG TAATCCCAGC TACTCGAGAG 3720 
GCTGAGACAC GAAAATTGCT TG AACCTGGG AGGCGGAGGT TGCAATGAGC CGAG ATCGCG 3780 
CTACTGCACT CCAGCCTGGG CAACAG AGAG AGACCTTGTC TCAAAAAACA ACAACAACAA 3840 
AAAGTCAAAG TCATAATAAG CAAATTATTG GCTTCTTTCT TCTAGACY AA AAG AAATTAA 3900 
AGAG ATG AAA CAATCAATTO CAAGGGTCAA AACTAGATTG GATCTTGGTT TGAATG AAAA 3960 
AAAGCATAAA ATATTCTTGC AATAATTGTA AAAATTTGAA TGTGGACTAA G TCCT AGATT 4020 
ATATTAAAAT ATTTTTAATT TTTTAAGCTT GACAAATGCA CTGATTGTTA TACTTTAAAT 4080 
AACTAAAAAT CTOAGAATCC ACAGTGCTAC AGACAATAAA TGATAAAATG GGAAAAAAAA 4140 
AAAAAAAAAA A 



Sen ID NO: 1 8 Protein sequence: 
Protein Accession #: NP_079335.t 

1 U 21 31 41 51 

MSLDCTSHZA LGAASPAPEE TYDHLSEVPV TREQLNHYRN VAQNARSELA ATLVKFECAQ 60 
SELQDLRSKM LSKEVSCQEL KAEMESYKEN NARKSSLLTS LRDRVQELEE ESAALSTSKI 120 
RTE1TAHA AI KENQELKKKV VELNEKLQKC SKENEENKKQ VSKNCRKHEE FLTQLRDCLD 1 80 
PDERNDKASD EDLILKLRDL RKENEFVKGQ IVILEETINV HEMEAKASRE TIMRLASEVN 240 
REQKKAASCT EEKEKLNQDL LSAVEAKEAL EREVKIFQER LLAGQQVWDA SKQEVSLLKK 300 
SSSELEKSLK ASQDA VTTSQ SQYSSFREKI AALLRGRLSM TGSTEDTTLE KIREMDSREE 360 
SRDRM VSQLE AQISELVEQL GKESGFHQKA LQRAQKAENM LETLQGQLTH LEAELVSGGV 420 
LRDNLNFEBCQ KYLKFLDQLS QKMKLDQMAA ELGFDTRLDV VLARTEQLVR LESNAVIENK 480 
TIAHNLQRKL KTQKERLESK ELHMSLLRQK IAQLEEEfCQA RTALWERDN AHLTIRNLQK 540 
KVERLQKELN TCRDLHTEUC AKLADTNELK DCTLEQTKAI EDLNKSRDQL EKMKEKAEKK 600 
LMSVKSELDT TEHEAKENKE RARNMIEWT SEMKTLKKSL EEAEKREXQL ADFREVVSQM 660 
LGLNVTSLAL PDYEIIKCLE RLVHSHQHHF VTCACLKDVT TGQERHPQGH LQLLH 



Seq ID NO: 19 DNA sequence 

Nucleic Acid Accession #: AF071552, NM_000662 

Coding sequence: 441 -13 13 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTTTGTATAA GGCTCAGCTA AAAGGGAAAT TGAGTGGGTC AOGTACCAC G OAT ACTATAC 60 
ACTCTATTGC ATGATTCTCC TGCCTACATC AGAAGACGTT TATAAGCCTA TT TTAAA GGA 120 
TACCAGTTGG AATCTCTCTT TTATTAATCA CCAAG AG AAC CATG AACAAG CTGTTTATCA 1 80 
TTTG ACTCAT CATTTAATCT TGATTTCCAG CTTCTCACAC TTG AAAG AAG ACATAATACA 240 
TTTCTCAC AG GATTCTGGGA CTATTAACTG A ACTTATGTG TGTAAA AGGA ATTCATACAA 300 
TGAAAGCACT AGAAATAATT ATTATACTTA TAACCATTGT ATTTTTACAT GTTTA AAATA 360 
TAGCCATAAT TAGCCTACTC AAATCCAAGT GTAAAAGTAA AATGATTTGC TTTCGTTTTG 420 
TTTTCCTTGC TTAGGGGATC ATGGACATTG AAGCATATCT TGAAAGAATT GGCTATAAGA 480 
AGTCTAGG AA CAAATTGGAC TTGGAAACAT TAACTGACAT TCTTCAACAC CAGATCCGAG 540 
CTGTTCCCTT TGAG AACCTT AACATCCATT GTGGGGATGC CATGGACTTA GGCTTAGAGG 600 
(XATTTTTG A TCAAGTTGTG AGAAGAAATC GGGGTGGATG GTGTCTCCAG GTCAATCATC 660 
TTCTGTACTG GGCTCTGACC ACTATTGGTT TTGAGACCAC GATGTTGGG A GGGTATGTTT 720 
ACAGCACTCC AGCCAAAAAA TACAGCACTG GCATGATTCA CCTTCTCCTG CAGGTGACCA 780 
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» 

TTGATGGCAG GAACTACATT GTCGATGCTG GGTTTGGACG CTCATACCAO ATGTGGCAGC 840 
CTCTGGAOTT A ATTTCTGGG AAGG ATCAGC CTCAGGTGCC TTGTGTCTTC CGTTTGACGG 900 
AAdAGAATGG ATTCTGGTAT CTAGACCAAA TCAGAAGGGA ACAGTACATT CCAAATGAAG 960 
AATTTCTTCA TTCTGATCTC CTAGAAGACA GCAAATACCG AAAAATCTAC TCCTTTACTC 1020 
TTAAGCCTCG AACAATTGAA GATTTTGAGT CTATGAATAC ATACCTGCAG ACATCTCCAT 1080 
CATCTGTGTT TACTAGTAAA TCATTTTGTT CCTTGCAGAC CCCAGATGGG GTTCACTGTT 1 140 
TGGTGGGCTT CACCCTCACC CATAGGAGAT TCAATTATAA GGACAATACA GATCTAATAG 1200 
AGTTCAAGAC TCTGAGTGAG GAAGAAATAG AAAAAGTGCT GAAAAATATA TTTAATATTT 1260 
CCTTGCAGAG AAAGCTTGTG CCCAAACATG GTGATAGATT TTTTACTATT I^QAATAAGG 1320 
AGTAAAACAA TCTTGTCTAT TTGTCATCCA GCTCACCAGT TATCAACTGA CGACCTATCA 1380 
TGTATCTTCT GTACCCTTAC CTTATTTTGA AGAAAATCCT AGACATCAAA TCATTTCACC 1440 
TATAAAAATG TCATCATATA TAATTAAACA G C MTU AAA GAAACATAAC CACAAACCTT 1500 
TTCAAATAAT AATAATAATA ATAATAATAA ATGTATTTTA AAGATGGCCT GTGGTTATCT 1560 
TGGAAATTGG TGATTTATGC TAGAAAGCTT TTAATGTTGG TTTATTGTTG AATTC 

Seq ID NO: 20 Protein sequence: 
Protein Accession #: NPJ300653.1 



1 11 21 31 41 51 
I I I I I I 

MDEEAYLERI GYKKSRNKLD LETLTDILQH QIRAVPFENL N1HCGDAMDL GLEAIFDQVV 60 
RRNRGGWCLQ VNHLLYWALT T1GFETTMLG GYVYSTPAKK YSTGMIHLLL QVTIDGRNYI 120 
VDAGFGRSYQ MWQPLEUSG KDQPQVPCVF RLTEENGFWY LDQIRREQY1 PNEEFLHSDL 1 80 
LEDSKYRKJY SFTLKPRTIE DFESMNTYLQ TSPSSVFTSK SFCSLQTPDG VHCLVGFTLT 240 
HRRFNYKDNT DLIEFKTLSE EEIEKVLKNI FN1SLQRKLV PKHGDRFFTI 

Seq ID NO: 21 DNA sequence 

Nucleic Acid Accession #: . NMJI03714 

Coding sequence: 1 23-1 03 1 (underlined sequences correspond to start and stop cpdons) 
1 11 21 31 41 51 

CGGCACGAGC AAAAAGGAAG AGTGGGAGGA GGAGGGGAAG CGGCGAAGGA GGAAGAGGAG 60 
GAGGAGGAAG AGGGGAGCAC AAAGGATCCA GGTCTCCCGA CGGGAGGTTA ATA CCAA GAA 120 
CCATGTGTGC CG AGCGGCTG GGCCAGTTCA TGACCCTGGC TTTGGTGTTG GCC ACCTTTG 1 80 
ACCCGGCGCG GGGGACCGAC GCCACCAACC CACCCGAGGG TCCCCAAGAC AGGAGCTCCC 240 
AGCAG AA AGG CCGCCTGTCC CTGCAGAATA CAGCGGAG AT CCAGCACTGT TTGGTCAACG 300 
CTGGCGATGT GGGGTGTGGC GTOTTTG AAT GTTTCG AG AA CAACTCTTGT GAGATTCGGG 360 
GCTTACATGG GATTTGCATG ACTTTTCTGC ACAACGCTGG AAAATTTGAT GCCCAGGGCA 420 
AGTCATTCAT CAAAGACGCC TTGAAATGTA AGGCCCACGC TCTGCGGCAC AGGTTCGGCT 480 
GCATAAGCCG GAAGTGCCCG GCCATCAGGG AA ATGGTGTC CCAGTTGCAG CGGGAATGCT 540 
ACCTCAAGCA CG ACCTGTGC GCGGCTGCCC AGGAG AACAC CCGGGTGATA GTGGAGATGA 600 
TCCATTTCAA GGACTTGCTG CTGCACGAAC CCTACGTGG A CCTCGTGAAC TTGCTGCTGA 660 
CCTOTGGGGA GGAGGTGAAO GAGGCCATCA CCCACAGCGT GCAGGTTCAG TGTGAGCAGA 720 
ACTGGGGA AG CCTGTGCTCC ATCTTG AGCT TCTGCACCTC GGCCATCCAG AAGCCTCOCA 780 
CGGCGCCCCC CGAGCGCCAG CCCCAGGTGG ACAGAACCAA GCTCTCCAGG GCCCACCACG 840 
GGGAAGCAGG ACATCACCTC CCAGAGCCCA GCAGTAGGGA GACTGGCCG A GGTGCCAAGG 900 
GTGAGCGAGG TAGCA AGAGC CACCCAAACG CCCATGCCCG AGGCAGAGTC GGGGGCCTTG 960 
GGGCTCAGGG ACCTTCCGGA AGCAGCGAGT GGGAAGACGA ACAGTCTGAG TA TTCT GATA 1020 
TCCGGAGG TG AA ATGAAAGO CCTGGCCACG AAATCTTTCC TCCACGCCGT CCATTTTCTT 1080 
ATCTATGG AC ATTCCAAAAC ATTTACCATT AGAGAGGGGG GATGTCACAC GCAGGATTCT 1 140 
GTGGGGACTG TGGACTTCAT CGAGGTGTGT GTTCGCGGAA CGGACAGGTG AGATGGAGAC 1200 
CCCTGGGGCC GTGGGGTCTC AGGGGTGCCT GGTGAATTCT GCACTTACAC GTACTCAAGG 1260 
GAGCGCGCCC GCGTTATCCT CGTACCTTTG TCTTCTTTCC ATCTGTGGAG TCAGTGGGTG 1320 
TCGGCCGCTC TGTTGTGGGG GAGGTGAACC AGGGAGGGGC AGGGCAAGGC AGGGCCCCCA 1 380 
GAGCTGGGCC ACACAGTGGG TGCTGGGCCT CGCCCCGAAG CTTCTGGTGC AGCAGCCTCT 1440 
GGTGCTGTCT CCGCGG AAGT CAGGGCGGCT GGATTCCAGG ACAGGAGTGA ATGTAAAAAT 1500 
AAATATCGCT TAGAATGCAG GAGAAGGGTG GAGAGGAGGC AGGGGCCGAG GGGGTGCTTG 1560 
GTGCCAAACT GAAATTCAGT TTCTTGTGTG GGGCCTTGCG GTTCAGAGCT CTTGGCGAGG 1620 
GTGG AGGGAG GAGTGTCATT TCTATGTGTA ATTTCTGAGC CATTGTACTG TCTGGGCTGG 1 680 
GGGGGACACT GTCCAAGGGA GTGGCCCCTA TGAGTTTATA TTTTAACCAC TGCTTCAAAT 1740 
CTCGATTTCA CTTTTTTTAT TTATCCAGTT ATATCTACAT ATCTGTCATC TAAATAAATG 1 800 
GCTTTCAAAC AAAAAAAAAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 22 Protein sequence: 
Protein Accession NPJJ03705 



1 11 21 31 41 51 
I I I I I I 

MCAERLGQFM TLALVLATFD PARGTDATNP PEGPQDRSSQ QKGRLSLQNT AEIQHCLVNA 60 
GDVGCGVFEC FENNSCEIRG LHGICMTFLH NAGKFDAQGK SFIKDAUfCCK AHALRHRFGC 120 
1SRKCPAIRE MVSQLQRECY LKHDLCAAAQ ENTRVIVEMI HFKDLLLHEP YVDLVNLLLT 180 
CGEEVKEAIT HSVQVQCEQN WGSLCSILSF CTSAIQKPPT APPERQPQ VD RTKLSRAHHG 240 
EAGHHLPEPS SRETGRGAKG ERGSKSHPNA HARGRVGGLG AQGPSGSSEW EDEQSEYSDI 300 
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RR 



Scq ID NO: 23 DNA sequence 

Nucleic Acid Accession #: NMJ005264.1 

Coding sequence: 557-1954(under!ined sequences conespond to start and stop codons) 



1 11 21 31 41 51 
I I I I I I 

GAATTCCOGC CAGAAGAAAT CTGGCCTCGG AACACGCCAT TCTCCGCGCC GCTTCCAATA 60 
ACCACTAACA TCCCTAACGA GCATCCG AGC CGAGGGCTC T GCTCGGAAAT CGTCCT GGCC 120 
CAACTCGGCC CTTCGAGCTC TCGAAGATTA CCGCATCTAT TTTTTTTTTC TTTTTTTTCT 180 
TTTCCTAGCG CAG ATAAAGT GAGCCCGGAA AGGGAAGGAG GGGGCGGGGA CACCATTGCC 240 
CTGAAAGAAT AAATAAGTAA ATAAACAAAC TGGCTCCTCG CCGCAGCTGG ACGCGGTCGG 300 
TTGAGTCCAG GTTGGGTCGG ACCTGAACCC CTAAAAGCGG AACCGCCTCC CGCCCTCGCC 360 
ATCCCGGAGC TGAGTCGCCG GCGGCGGTGG CTGCTGCCAG ACCCGGAGTT TCCTCTTTCA 420 
CTGGATGGAG CTGAACTTTG GGCGGCCAGA GCAGC AC AGC TGTCCG GGG A TCGCTGCACG 480 
CTGAGCTCCC TCGGCAAGAC CCAGCGGCGG CTCGGG ATTT TTTTGGGGGG GCGGGGACCA 540 
GCCCCGCGCC GGCAC CATGT TCCTGGCGAC CCTGTACTTC GCGCTGCCGC TCTTGGACTT 600 
GCTCCTGTCG GCCG AAGTGA GCGGCGG AGA CCGCCTGGAT TGCGTG AAAG CCAGTGATCA 660 
GTGCCTGAAG GAGCAGAGCT GCAGCACCAA GTACCGCACG CTAAGGCAGT GCGTGGCGGG 720 
CAAGGAGACC AACTTCAGCC TGGCATCCGG CCTGGAGGCC AAGGATGAGT GCCGCAGCGC 780 
CATGGAGGCC CTG AAGCAGA AGTCGCTCTA CAACTGCCGC TGCAAGCGGG GTATGAAGAA 840 
GGAGAAGAAC TGCCTGCGCA TTTACTGGAG CATGTACCAG AGCCTGCAGG GAAATGATCT 900 
GCTGGAGGAT TCCCCATATG AACCAGTTAA CAGCAGATTG TCAGATATAT TCCGGGTGGT 960 
CCCATTCATA TCAGATGTTT TTCAGCAAGT GGAGCACATT CCCAAAGGGA ACAACTGCCT 1020 
GGATGCAGCG AAGGCCTGCA ACCTCGACGA CATTTGCAAG AAGTACAGGT CGGCGTACAT 1080 
CACCCCGTGC ACCACCAGCG TGTCCAACGA TGTCTGCAAC CGCCGCAAGT GCCACAAGGC 1 140 
CCTCCGGCAG TTCTTTGACA AGGTCCCGGC CAAGCACAGC TACGGAATGC TCTTCTGCTC 1200 
CTGCCGGGAC ATCGCCTGCA CAGAGCGGAG GCOACAGACC ATCGTGCCTG TGTGCTCCTA 1260 
TGAAGAGAGG GAGAAGCCCA ACTGTTTGAA TTTGCAGGAC TCCTGCAAGA CGAATTACAT 1320 
CTGCAGATCT CGCCTTGCGG A TTTTTTT AC CAACTGCCAG CCAGAGTCAA GGTCTGTCAG 1380 
CAGCTGTCTA AAGGAAAACT ACGCTGACTG CCTCCTCGCC TACTCGGGGC TTATTGGCAC 1440 
AGTCATGACC CCCAACTACA TAGACTCCAG TAGCCTCAGT GTGGCCCCAT GGTGTGACTG 1500 
CAGCAACAGT GGGAACGACC TAGAAGAGTG CTTGAAATTT TTGAATTTCT TCAAGGACAA 1560 
TACATGTCTT AAAAATGCAA TTCAAGCCTT TGGCAATGGC TCCGATGTGA CCGTGTGGCA 1620 
GCCAGCCTTC CCAGTACAGA CCACCACTGC CACTACCACC ACTGCCCTCC GGGTTAAGAA 1680 
CAAGCCCCTG GGGCCAGCAG GGTCTGAGAA TGAAATTCCC ACTCATGTTT TGCCACCGTG 1740 
TGCAAATTTA CAGGCACAGA AGCTGAAATC CAATGTGTCG GGCAATACAC ACCTCTGTAT 1800 
TTCCAATGGT AATTATGAAA AAGAAGGTCT CGGTGCTTCC AGCCACATAA CCACAAAATC 1860 
AATGGCTGCT CCTCCAAGCT GTGGTCTGAG CCCACTGCTG GTCCTGGTGG TAACCGCTCT 1920 
GTCCACCCTA TTATCTTTAA CAGAAACATC ATAGCTGCAT TAAAAAAATA CAATATGGAC 1980 
ATGTAAAAAG ACAA AAACCA AGTTATCTGT TTCCTGTTCT CTT GTATAGC TGAAATTCCA 2040 
GTTTAGGAGC TCAGTTGAGA AACAGTTCCA TTCAACTGG A ACATTTTTTT TTTTCCTTTT 2 1 00 
AAGAAAGCTT CTTGTGATCC TTCGGGGCTT CTGTGAAAAA CCTGATGCAG TGCTCCATCC 2160 
AAACTCAGAA GGCTTTGGGA TATGCTGTAT TTTAAAGGGA CAGTTTGTAA CTTGGGCTGT 2220 
AAAGCAAACT GGGGCTGTGT TTTCGATGAT GATGATCATC ATGATCATGA TGATTTTAAC 2280 
AGTTTTACTT CTGGCCTTTC CTAGCTAGAG AAGGAGTTAA TATTTCTAAG GTAACTCCCA 2340 
TATCTCCTTT AATGACATTG ATTTCTAATG ATATAAATTT CAGCCTACAT TGATGCCAAG 2400 
CTTTTTTG CC ACAAAGAAGA TTCTTACCAA GAGTGGGCTT TGTGGAAACA GCTGGTACTG 2460 
ATGTTCACCT TTATATATGT ACTAGCATTT TCCACGCTGA TGTTTATGTA CTGTAAACAG 2520 
TTCTGCACTC TTGTACAAAA GAAAAAACCA CCCGGAATTC 

Seq ID NO: 24 Protein sequence: 
Protein Accession #: , NP_0O5255 

1 11 21 31 41 51 
I I I I I I 

MFLATLYFAL PLLDLLLSAE VSGGDRLDCV KASDQCLKEQ SCSTKYRTLR QCVAGKETNF 60 
SLASGLEAKD ECRSAMEALK QKSLYNCRCK RGMKKEKNCL RIYWSMYQSL QGNDLLEDSP 1 20 
YEPVNSRLSD IFR VVPFISD VFQQVEH1PK GNNCLDAAKA CNLDDICKKY RSAYITPCTT 1 80 
SVSNDVCNRR KCHKALRQFF DKVPAKHSYG MLFCSCRD1A CTERRRQTIV PVCSYEEREK 240 
PNCLNLQDSC KTNYICRSRL ADFFTNCQPE SRSVSSCLKE NYADCLLAYS GLIGTVMTPN 300 
YIDSSSLSVA PWCDCSNSGN DLEECLKFLN FFKDNTCLKN AIQAFGNGSD VTVWQPAFPV 360 
QTTTATTTTA LRVKNKPLGP AGSENEIPTH VLPPCANLQA QKLKSNVSGN THLCISNGNY 420 
EKEGLGASSH ITTKSMAAPP SCGLSPLLVL WTALSTLLS LTETS 

Seq ID NO: 25 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1 576(cntire sequence represents open reading frame) 

I 11 21 31 41 51 

CTTT TGTTTC GCCATGCCTA GTCTAGTGGT ATCTGGAATA ATGGAAAGAA ATGGGGGCTT 60 
TGGAGAACTA GGATGTTTCG GGGGAAGCGC TAAGGACCGA GGGCTGCTGO AAGACGAGCG 120 
CGCCCTTCAG CTGGCTCTCG ATCAACTCTG CCTCCTGGGT TTGGGGG AGC CCCCCGCCCC 1 80 
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CAGGGCGGGC GAGGACGGGG GAGGTGGGGG GGGCGGCGCC CCCGCGCAGC CGACAGCCCC 240 
CCGGCAGCCG GCGCCGCCGC CGCCGCCCGC GGCGCCCCCG GCCGCCCCGA CGACGGCCCC 300 
CGCAGCGCAG ACGCCCCAGC CCCCCACCGC CCCCAAAGGG GCGAGCGACG CCAAGCTCTG 360 
CGCTCTCTAC AAAGAGGCCG AGCTGCGCCT GAAGGGCAGC AGCAACACCA CGGAGTGTGT 420 
TCCCGTGCCC ACCTCCGAGC ACGTGGCCGA GATCGTGGGC AGGCAAGGCT GCAAGATTAA 480 
GGCCTTGAGG GCCAAGACCA ACACCTACAT CAAGACACCG GTGAGGGGCG AGGAACCAGT 540 
GTTCATGGTG ACAGGGCGAC GGGAGGACGT GGCCACAGCC CGGCOGGAAA TCAT CTCAOC 600 
AGCGG AGCAC TTCTCCATG A TCCGTGCCTC CCGCAACAAG TCAGGCGCCG CCTTTGGTGT 660 
GGCTCCTGCT CTGCCCGGCC AGGTGACCAT CCGTGTGCGG GTGCCCTACC GCGTGGTGGG 720 
GCTGGTGGTG GGCCCCAAAG GGGCAACCAT CAAGCGCATC CAGCAGCAAA CCAACACATA 780 
CATTATCACA CCAAGCCGTG ACCGCGACCC CGTGTTCGAG ATCACGGGTG CCCCAGGCAA 840 
CGTGGAGCGT GCGCGCGAGG AGATCGAGAC GCACATCGCG GTGCGCACTG GCAAGATCCT 900 
CGAGTACAAC AATGAAAACG ACTTCCTGGC GGGGAGCCCC GACGCAGCAA TCGATAGCCG 960 
CTACTCCGAC GCCTGGCGGG TGCAOCAGCC CGGCTGCAAG CCCCTCTCCA CCTTCCGGCA 1020 
GAACAGCCTG GGCTGCATCG GCGAGTGCGG AGTGGACTCT GGCTTTGAGG CCCCACGCCT 1080 
GGGTGAGCAG GGCGGGGACT TTGGCTACGG CGGGTACCTC TTTCCGGGCT ATGGCGTGGG 1 140 
CAAGCAGGAT GTGTACTACG GCGTGGCCGA GACTAGCCCC CCGCTGTGGG CGGGCCAGGA 1200 
GAACGCCACG CCCACCTCCG TGCTCTTCTC CTCTGCCTCC TCCTCCTCCT CCTCTTCCGC 1260 
CAAGGCCCGC GCTGGGCCCC CGGGCGCACA CCGCTCCCCT GCCACTTCCG CGGG ACCCGA 1 320 
GCTGGCCGGA CTCCCGAGGC GCCCCCCGGG AGAGCCGCTC CAGGGCTTCT CTAAACTTGG 1380 
TGGGGGCGGC CTGCGGAGCC CCGGCGGCGG GCGGGATTGC ATGGTCTGCT TTGAGAGCGA 1440 
AGTGACTGCC GCCCTTGTGC CCTGCGGACA CAACCTGTTC TGCATGGAGT GTGCAGTACG 1500 
CATCTGCG AG AGGACGGACC CAGAGTGTCC CGTCTGCCAC ATCACAGCCA CGCAAGCCAT 1560 
COG AATATTC TCCTAA • 

Seq ID NO. 26 Protein sequence: 

Protein Accession*: FGENESH predicted 



1 11 21 31 41 51 

FCFAMPSLVV SGIMERNGGF GELGCFGGSA KDRGLLEDER ALQLALDQLC LLGLGEPPAP 60 
RAGEDGGGGG GGAPAQPTAP PQPAPPPPPA APPAAPTTAP AAQTPQPPTA PKGASDAKLC , 120 
ALYKEAELRL KGSSNTTECV. PVPTSEHVAE IVGRQGOOK ALRAKTNTYI KTPVRGEEPV 1 80 
FMVTGRREDV ATARREIISA AEHFSMIRAS RNKSGAAFGV APALPGQVTI RVRVPYRWG 240 
LWGPKGATI KRIQQQTNTY UTPSRDRDP VFEITGAPGN VERAREEIET H1AVRTGKIL 300 
EYNNENDFLA GSPDAAIDSR YSDAWRVHQP GCKPLSTFRQ NSLGCIG ECG VDSGFEAPRL 360 
GEQGGDFGYG GYLFPGYGVG KQDVYYGVAETSPPLWAGQENATPTSVLFS SASSSSSSSA 420 
KARAGPPG AH RSPATSAGPE LAGLPRRPPG EPLQGFSKLG GGGLRSPGGG RDCMVCFESE 480 
VTAALVPCGH NLFCMECAVR ICERTDPECP VCHITATQAI RIFS 

Seq ID NO: 27 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted 

Coding sequence: 1 -2070 (underlined sequences correspond to stmt and stop codons) 
1 11 21 31 41 51 

ATGAGCGOTO CGGGGGTGGC GGCTGGGACG CGGCCCCCCA GCTCGCCGAC CCCGGGCTCT 60 
CGGCGCCGGC GCCAGCGCCC CTCTGTGGGC GTCCAGTCCT TGAGGCCGCA G AGCCCGCAG 1 20 
CTCAGGCAGA GCGACCCGCA GAAACGGAAC CTGGACCTGG AGAAAAGCCT GCAGTTCCTG 180 
CAGCAGCAGC ACTCGGAGAT GCTGGCCAAG CTCCATG AGG AGATCG AGCA TCTGAAGCGG 240 
GAAAACAAGG GTGAGCCGGC GCGGGGCCCT AGGCCGGCCC TGCCTCCCCA GGCACACTCA 300 
ACACTGCCGC TCCCGCAGCA CAGAAACACA GCCATCAACT CCAGCACACG CCTGGGCTCA 360 
GGGGGAACAC AGGACGGGGA GCCCCTCCAG ACTGTCCTTG CCCACCTGGC TGCACTGGCC 420 
CCTGTATGCC AACCCAGTGG GTACAGGTTC TGGGGG ACCT GGACAGATGC CGCTACCTCT 480 
AGCCGTGGCT GGACGATGTT ATGCAGCCAA GCACAGCACG TGCTGCTCTC GGGAAGCCCA 540 
GGGCCTGAGG TCATTGCAGG GCGGCAGGTG GCCACAGGGT GCTCCCCAGA CCTCCCTCCT 600 
CCAAGTAGAG CTGAAATGGG AAGGAACCCC TGGGACAGCC CCTGCCCTGC TAGATCTTTG 660 
CCTCAGATTG CTGCTGTGGC CAGGCCCAGG ATTTCCAGCC CTATGGCTCT GAGTCCTCAC 720 
ATGCTGGGGG CCCAGGGGAT ATGGACACAC TCCATCCAGG GATCCCTTCC TGCCATCTGG 780 
GCAGCAACCA TGGGG ACAAA GGGAGGAAGC AGAGTCCTGT TTCCTTGCCA CTTGTCCAAG 840 
GCACTTCCCC ATCCTGACAG CGGCCCCCAC CCAGCCCAGG ATCCTGGGCT GTGGTCTCAA 900 
GCTCACTTCC CATTATCTTT GGGGCTGGGG CTGACATCAG G AGG ACATCT G ACTGGTGGA 960 
TGGAGCCAGC CTGGGAACAT CGCAGCTGGG GCAGTGCCTA GGGCTCTCCC TTCCCAGGGA 1020 
GACATGGAGA AGGGGGTTGA GGGAGGGCCC TTCCCTAGCC GCTGTGGCAA CTCCAGTGAG 1080 
CTGTTCTGGG CAAAGTGTGG CCCAAGTCGG CAGCCCCAGC CCTGCAGTGC TGGGGACGCT 1140 
GACAGGACAC GGGAAGAGGC CATGCTTTCC CTCGGGACCT GCTGTTCCAT GTGTCCCAAG 1200 
CCCTCCTGCT TTCCAGATGG CCCCTCAGGA AACCACCTTT CCAGGGCCTC TGCTCCCTTG 1260 
GGCGCTCGCT GGGTCTGCAT CAACGGAGTG TGGGTAGAGC CGGGAGGACC CAGCCCTGCC 1320 
AGGCTGAAGG AGGGCTCCTC ACGGACACAC AGGCCAGGAG GCAAGCGTGG GCGTCTTGCG 1380 
GGCGGTAGCG CCGACACTGT GCGCTCTCCT GCAGACAGCC TCTCCATGTC AAGCTTCCAG 1440 
TCTGTCAAGT CCATCTCTAA TTCAGCCAAC TCTCAAGGCA AGGCCAGGCC CCAGCCCGGC 1500 
TCCTTCAACA AGCAAGATTC AAAAGCTGAC GTCTCCCAGA AGGCGGACCT GGAAGAGGAG 1560 
CCCCTACTTC ACAACAGCAA GCTGG ACAAA GTTCCTGGGG TACAAGGGCA GGCCAGAAAG 1620 
GAGAAAGCAG AGGCCTCTAA TGCAGGAGCT GCCTGTATGG GGAACAGCCA GCACCAGGGC 1680 
AGGCAGATGG GGGCGGGGGC ACACCCCCCA ATGATCCTGC CCCTTCCCCT GCGAAAGCCC 1740 
ACCACACTTA GGCAGTGCGA AGTGCTCATC CGCGAGCTGT GGAATACCAA CCTCCTGCAG 1800 
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ACCCAAGAGC TGCGGCACCT C AAGTCCCTC CTGGAAGGG A GCCAGAGGCC CCAGGCAOCC 1 860 
CCGGAGGAAG CTAGCTTTCC CAGGGACCAA GAAGCCACGC ATTTCCCCAA GGTCTCCACC 1920 
AAGAGCCTCT CCAAGAAATG CCTGAGCCCA CCTGTGGCGG AGCGTGCCAT CCTGCOCGCA 1980 
CTGAAGCAGA CCCCGAAGAA CAACTTTGCC G AGAGGCAGA AGAGGCTGCA GGCAATGCAG 2040 
AAACGGCGCC TGCATCGCTC AGTGCTTTGA 

Sea ID NO; 28 Protein sequence; 

Protein Accession #: FGENESH predicted 



t 11 21 31 41 51 

MSGAGVAAGT RPPSSPTPGS RRRRQRPSVG VQSLRPQSPQ LRQSDPQKRN LDLEKSLQFL 60 
QQQHSEMLAK LHEEIEHLKR ENKGEPARGP RPALPPQAHS TLPLPQHRNT AINSSTRLGS 120 
GGTQDGEPLQ TVLAHLAALA PVCQPSG YRF WGTWTDAATS SRGWTMLCSQ AQHVLLSGSP 1 80 
GPEVIAGRQVATQCSPDIJPPSRAEMGIWPV^SPCTARSLPQIAAVARPRISSPMALSPH 240 
MLGAQGIWTH SIQGSLPAIW AATMGTKGGS RVLFPCHLSK ALPHPDSGPH PAQDPGLWSQ 300 
AHFPLSLGLG LTSGGHLTGG WSQPGNIAAG AVPRALPSQG DMEKGVEGGP FPSRCGNSSE 360 
LFWAKCGPSR QPQPCSAGDA DRTREEAMLS LGTCCSMCPK PSCFPDGPSG NHLSRASAPL 420 
GARWVCINGV WVEPGGPSPA RLKEGSSRTH RPGGKRGRLA GGSADTVRSP ADSLSMSSFQ 480 
SVKSISNSAN SQGKARPQPG SFNKQDSKAD VSQKADLEEE PLLHNSKLDK VPG VQGQARK 540 
EKAEASNAGA ACMGNSQHQG RQMGAGAHPP MILPLPLRKP TTLRQCEVLI RELWNTNLLQ 600 
TQELRHLKSL LEGSQRPQAA PEEASFPRDQ EATHFPKVST KSLSKKCLSP PVAERAILPA 660 
LKQTPKNNFA ERQKRLQAMQ KRRLHRSVL 

Seq ID NO: 29 DMA sequence 

Nucleic Acid Accession #: NMJH23 19.2 

Coding sequence: 138-2405 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

CTCGTGCCGA ATTCGGCACG AGACCGCGTG TTCGCGCCTG GTAGAGATTT CTCGAAG ACA 60 
CCAGTGGGCC CGTGTGG AAC CAAACCTGCG CGCGTGGCCG GGCCGTGGG A C AACGA GGCC 120 
GCGGAGACGA AGGCGCAATG GCGAGGAAGT TATCTGTAAT CTTGATCCTG ACCTTTGCCC 180 
TCTCTGTCAC AAATCCCCTT CATGAACTAA AAGCAGCTGC TTTCCCCCAG ACCACTGAGA 240 
AAATTAGTCC GAATTGGGAA TCTGGCATTA ATGTTGACTT GGCAATTTCC ACACGGCAAT 300 
ATCATCTACA ACAGCTTTTC TACCGCTATG GAGAAAATAA TTCTTTGTCA GTTGAAGGGT 360 
TCAGAAAATT ACTTCAAAAT ATAGGCATAG ATAAGATTAA AAGAATCCAT ATACACCATG 420 
ACCACGACCA TCACTCAGAC CACGAGCATC ACTCAG ACCA TG AGCGTCAC TCAGACCATG 480 
AGCATCACTC AGACCACGAG CATCACTCTG ACCATGATCA TCACTCTCAC CATAATCATG 540 
CTGCTTCTGG TAAAAATAAG CGAAAAGCTC TTTGCCCAGA CCATGACTCA GATAGTTCAG 600 
GTAAAGATCC TAGAAACAGC CAGGGGAAAG GAGCTCACCG ACCAGAACAT GCCAGTGGTA 660 
GAAGG AATGT CAAGGACAGT GTTAGTGCTA GTG AAGTGAC CTCAACTGTG TACAACACTG 720 
TCTCTG AAGG AACTCACTTT CTAGAGACAA TAGAGACTCC AAGACCTGG A AAACTCTTCC 780 
CCAAAGATGT AAGCAGCTGC ACTCCACCCA GTGTCACATC AAAGAG CCGG GTGAGCCGGC 840 
TGGCTGGTAG G AAAACA AAT GAATCTGTGA GTG AGCCCCG AAAAGGCTTT ATGTATTCCA 900 
GAAACACAAA TG AAAATCCT CAGG AGTGTT TCAATGCATC AAAGCTACTG ACATCTCATG 960 
GCATGGGCAT CCAGGTTCCG CTGAATGCAA CAGAGTTCAA CTATCTCTGT CCAGCCATCA 1020 
TCAACCAAAT TGATGCTAGA TCTTGTCTGA TTCATACAAG TGAAAAGAAG GCTGAAATCC 1080 
CTCCAAAG AC CTATTCATTA CAAATAGCCT GGGTTGGTGG TTTTATAGCC AT TTCCAT CA 1 140 
TC AGTTTCCT GTCTCTGCTG GGGGTTATCT TAGTGCCTCT CATGAATCGG GTGTTTTTCA 1200 
AATTTCTCCT GAGTTTCCTT GTGGCACTGG CCGTTGGGAC TTTGAGTGGT GATGCTTTTT 1260 
TACACCTTCT TCCACATTCT CATGCAAGTC ACCACCATAG TCATAGCCAT GAAG AACCAG 1 320 
CAATGGAAAT GAAAAGAGGA CCACTTTTCA GTCATCTGTC TTCTCAAAAC ATAGAAGAAA 1380 
GTGCCTATTT TGATTCCACG TGGAAGGGTC TAACAGCTCT AGGAGGCCTG TATTTCATGT 1440 
TTCTTGTTGA ACATGTCCTC ACATTGATCA AACAATTTAA AGATAAGAAG AAAAAGAATC 1500 
AGAAGAAACC TGAAAATGAT GATGATGTGG AGATTAAGAA GCAGTTGTCC AAGTATGAAT 1560 
CTCAACTTTC AACAAATGAG GAGAAAGTAG ATACAGATGA TCGAACTGAA GGCTATTTAC 1620 
GAGCAGACTC ACAAGAGCCC TCCCACTTTG ATTCTCAGCA GCCTGCAGTC TTGGAAGAAG 1680 
AAGAGGTCAT GATAGCTCAT GCTCATCCAC AGGAAGTCTA CAATGAATAT GTACCCAGAG 1740 
GGTGCAAG A A TAAATGCCAT TCACATTTCC ACGATACACT CGGCCAGTCA GACGATCTCA 1 800 
TTCACCACCA TCATG ACTAC CATCATATTC TCCATCATCA CCACCACCAA A ACCACCATC 1 860 
CTCACAGTCA CAGCCAGCGC TACTCTCGGG AGGAGCTG AA AG ATGCCGGC GTCGCCACTT 1920 
TGGCCTGGAT GGTGATAATG GGTGATGGCC TGCACAATTT CAGCGATGGC CTAGCAATTG 1980 
GTGCTGCTTT TACTG AAGGC TTATCAAGTG GTTTAAGTAC TTCTGTTGCT GTGTTCTGTC 2040 
ATGAGTTGCC TCATG AATTA GGTGACTTTG CTGTTCTACT AAAGGCTGGC ATGACCGTTA 2100 
AGCAGGCTGT CCTTTATAAT GCATTGTCAG CCATGCTGGC GTATCTTGGA ATGGCAACAG 2160 
GAATTTTCAT TGGTCATTAT GCTGAAAATG TTTCTATGTG GATATTTGCA CTTACTGCTG 2220 
GCTTATTCAT GTATGTTGCT CTGGTTGATA TGGTACCTGA AATGCTGCAC AAT GATGCT A 2280 
GTGACCATGG ATGTAGCCGC TGGGGGTATT TCTTTTTACA GAATGCTGGG ATGCTTTTGG 2340 
GTTTTGGAAT TATGTTACTT ATTTCCATAT TTGAACATAA AATCGTGTTT CGTATAAATT 2400 
TCTAGTTAAG GTTTAAATGC TAGAGTAGCT TAAAAAGTTG TCATAGTTTC AGTAGGTCAT. 2460 
AGGGAGATG A GTTTGTATGC TGTACTATGC AGCGTTTAAA GTTAGTGGGT TTTGTG ATTT 2520 
TTGTATTG AA TATTGCTGTC TGTTACAAAG TCAGTTAAAG GTACGTTTTA ATATTTAAGT 2580 
TATTCTATCT TGGAGATAAA ATCTGTATGT GCAATTCACC GGTATTACCA GTTTA TTATG 2640 
TAAACAAG AG ATTTGGC ATG ACATGTTCTG TATGTTTCAG GGAAAAATGT CTTTAATGCT 2700 
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TTTTCAAGAA CTAACACAGT TATTCCTATA CTGG ATTTTA GGTCTCTGAA GAACTGCTGG 2760 
TGTTTAGGAA TAAGAATGTG CATGAAGCCT AAAATACCAA GAAAGCTTAT ACTGAATTTA 2820 
AGCAAAG AAA TAAAGGAGAA AAGAGAAGAA TCTG AGAATT GGGGAGGCAT AGATTCTTAT 2880 
AAA AATCACA AAATTTGTTG TAAATTAGAG GGG AGAAATT TAGAATTAAG TATAAAAAGG 2940 
CAGAATTAGT ATAGAGTACA TTCATTAAAC ATTTTTGTCA GGATTATTTC CCGTAAAAAC 3000 
GTAGTG AGCA CTCTCATATA CTAATTAGTG TACATTTAAC TTTGTATAAT ACAG AAATCT 3060 
AAATATATTT AATGAATTCA AGCAATATAC ACTTG ACCAA GAAATTGGAA TTTCAAAATG 3 120 
TTCGTGCGGG TTATATACCA GATGAGTACA GTG AGTAGTT T ATGTATC AC CAGACTGGGT 3180 
TATTGCCAAG TTATATATCA CCAAAAGCTG TATGACTGGA TGTTCTGGTT ACCTGGTTTA 3240 
CAAAATTATC AGAGTAGTAA AACTTTG ATA TATATGAGG A TATTAAAACT ACACTAAGTA 3300 
TCATTTGATT CGATTCAGAA AGT ACTTTGA TATCTCTCAG TGCTTCAGTG CTATCATTGT 3360 
GAGCAATTGT CTTTATATAC GGTACTGTAG CCATACTAGG CCTGTCTGTG GCATTCTCTA 3420 
GATGTTTCTT TTTTACACA A TAAATTCCTT ATATCAGCTT G 

Seq ID NO: 30 Protein sequence: 
Protein Accession #: NP _03645 1 .2 

1 II 21 31 41 51 
I I I I I I 

MARKLSV1LI LTFALSVTNP LHELKAAAFP QTTEKISPNW ESGINVDLAI STRQYHLQQL 60 
FYRYGENNSL SVEGFRKLLQ NIGIDKIKRI HIHHDHDHHS DHEHHSDHER HSDHEHHSDH 1 20 
EHHSDHDHHS HHNHAASGKN KRKALCPDHD SDSSGKDPRN SQGKGAHRPE HASGRRNVKD 1 80 
SVSASEVTST VYNTVSEGTH FLETIETPRP GKLFPKDVSS STPPSVTSKS RVSRLAGRKT 240 
NESVSEPRKG FMYSRNTNEN PQECFNASKL LTSHGMGIQV PLNATEFNYL CPAUNQIDA 300 
RSCUHTSEK KAEIPPKTYS LQIAWVGGFI AISIISFLSL LGVILVPLMN RVFFKFLLSF 360 
LVALA VGTLS GDAFLHLLPH SHASHHHSHS HEBPAMEMKR GPLFSHLSSQ NIEESAYFDS 420 
TWKGLTALGG LYFMFLVEHV LTLDCQFKDK KKKNQKKPEN DDDVEDCKQL SKVESQLSTN 480 
EEKVDTDDRT EGYLRADSQE PSHFDSQQPA VLEEEEVMIA HAHPQEVYNE YVPRGCKNKC 540 
HSHFHDTLGQ SDDLIHHHHD YHMLHHHHH QNHHPHSHSQ RYSREELKDA GVATLAWMV1 600 
MGDGLHNFSD GLAIGAAFTE GLSSGLSTSV AVFCHELPHE LGDFAVLLXA GMTVKQAVLY 660 
NALSAMLAYL GMATGIFIGH YAENVSMWIF ALTAGLFMYV ALVDMVPEML HNDASDHGCS 720 
RWGYFFLQNA GMLLGFGIML USIFEHKIV FRINF 

Seq ID NO: 31 DNA sequence 

Nucleic Acid Accession #: NM 002184.I 

Coding sequence: 256-301 2(imderiined sequences correspond to start and stop cottons) 

1 11 21 31 41 51 

I 1 I I I I 

GAGCAGCCAA AAGGCCCGCG GAGTCGCGCT GGGCCGCCCC GGCGCAGCTG AACCGGGGGC 60 
CGCGCCTGCC AGGCCGACGG GTCTGGCCCA GCCTGGCGCC AAGGGGTTCG TGCGCTGTGG 120 
AGACGCGGAG GGTCGAGGCG GCGCGGCCTG AGTGAAACCC AATGGAAAAA GCATGACATT 180 
TAGAAGTAGA AGACTTAGCT TCAAATCCCT ACTCCTTCAC TTACTAATTT TGTGA TTTGG 240 
AAATATCCGC GCAAGATGTT GACGTTGCAG ACTTGGGTAG TGCAAGCCTT GTTTATTTTC 300 
CTCACCACTG AATCTACAGG TGAACTTCTA GATCCATGTG GTTATATCAG TCCTGAATCT 360 
CCAGTTGTAC AACTTCATTC TAATTTCACT GCAGTTTGTG TGCTAAAGGA AAAATGTATG 420 
G ATTATTTTC ATGTAAATGC TA ATT AC ATT GTCTGGAAAA CAAACCATTT TACTATTCCT 480 
AAGGAGCAAT ATACTATCAT AAACAGAACA GCATCCAGTG TCACCTTTAC AGATATAGCT 540 
TCATTAAATA TTCAGCTCAC TTGCAACATT CTTACATTCG GACAGCTTG A ACAGAATGTT 600 
TATGG AATCA CAATAATTTC AGGCTTGCCT CCAGAAAAAC CTAAAAATTT GAGTTGCATT 660 
GTGA ACG AGG GGAAG AAAAT GAGGTGTGAG TGGG ATGGTG GAAGGGAAAC ACACTTGGAG 720 
ACAAACTTCA CTTTAAA ATC TG AATGGGCA ACACACAAGT TTGCTG ATTG CAAAGCAAAA 780 
CGTGACACCC CCACCTCATG CACTGTTGAT TATTCTACTG TGTATTTTGT CAACATTGAA 840 
GTCTGGGTAG AAGCAGAGAA TGCCCTTGGG AAGGTTACAT CAGATCATAT CAATTTTGAT 900 
CCTGTATATA AAGTGAAGCC CAATCCGCCA CATAATTTAT CAGTGATCAA CTCAGAGGAA 960 
CTGTCTAGTA TCTTAAAATT GACATGGACC AACCCAAGTA TTAAGAGTGT TATAATACTA 1 020 
AAATATAACA TTCAATAT AG GACCAAAGAT GCCTCAACTT GGAGCCAGAT TCCTCCTGAA 1080 
GACACAGCAT CCACCCGATC TTCATTCACT GTCCAAGACC TTAAACCTTT TACAGAATAT 1140 
GTGTTTAGGA TTCGCTGTAT GAAGGAAGAT GGTAAGGGAT ACTGGAGTGA CTGGAGTGAA 1200 
GAAGCAAGTG GGATCACCTA TGAAGATAGA CCATCTAAAG CACCAAGTTT CTGGTATAAA 1260 
ATAGATCCAT CCCATACTCA AGGCTACAGA ACTGTACAAC TCGTGTGGAA GACATTGCCT 1320 
CCTTTTGAAG CCAATGGAAA AATCTTGGAT TATGAAGTGA CTCTCACAAG ATGGAAATCA 1380 
CATTTACAAA ATTACACAGT TAATGCCACA AAACTGACAG TAAATCTCAC AAATGATCOC 1440 
TATCTAGCAA CCCTAACAGT AAGAAATCTT GTTGGCAAAT CAGATGCAGC TGTTTTAACT 1500 
ATCCCTGCCT GTGACTTTCA AGCTACTCAC CCTGTAATGG ATCTTAAAGC ATTCCCCAAA 1560 
GATAACATGC TTTGGGTGGA ATGGACTACT CCAAGGGAAT CTGTAAAGAA ATATATACTT 1620 
GAGTGGTGTG TGTTATCAGA TAAAGCACCC TGTATCACAG ACTGGCAACA AGAAGATGGT 1680 
ACCGTGCATC GCACCTATTT AAGAGGGAAC TTAGCAGAGA GCAAATGCTA TTTGATAACA 1740 
GTTACTCCAG TATATGCTGA TGGACCAGGA AGCCCTGAAT CCATAAAGGC ATACCTTAAA 1800 
CAAGCTCCAC CTTCCAAAGG ACCTACTGTT CGGACAAAAA AAGTAGGGAA AAACGAAGCT 1860 
GTCTTAGAGT GGGACCAACT TCCTGTTGAT GTTCAGAATG GATTTATCAG AAATTATACT 1920 
ATATTTTATA GAACCATCAT TGGAAATGAA ACTGCTGTGA ATGTGGATTC TTCCCACACA 1980 
GAATATACAT TGTCCTCTTT GACTAGTG AC ACATTGTACA TGGTACG AAT GGCAGCATAC 2040 
ACAGATGAAG GTGGGAAGG A TGGTCCAGA A TTCACTTTTA CTACCCCAAA GTTTGCTCAA 2100 
GGAGAAATTG AAGCCATAGT CGTGCCTGTT TGCTTAGCAT TCCTATTGAC AACTCTTCTG 2160 
GG AGTGCTGT TCTGCTTTAA TAAGCG AG AC CTAATTAAAA AACACATCTG GCCTAATGTT 2220 
CCAGATCCTT CAAAGAGTCA TATTGCCCAG TGGTCACCTC ACACTCCTCC AAGGCACAAT 2280 
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TTTAATTCAA AAGATCAAAT GTATTCAGAT GGCAATTTCA CTGATGTAAG TGTTGTGGAA 2340 
ATAGAAGCAA ATOACAAAAA GCCTTTTCCA GAAGATCTGA AATCA7TGGA CCTGTTCAAA 2400 
AAGGAAAAAA TTAATACTGA AG G AC AC AG C AGTGGTATTG GGGGGTCTTC ATGCATGTCA 2460 
TCTTCTAGGC CAAGCATTTC TAGCAGTG AT GAAAATG AAT CTTCACAAAA CACTTCGAGC 2520 
ACTGTCCAGT ATTCTACCGT GGTACACAGT GGCTACAGAC ACCAAGTTCC GTCAGTCCAA 2580 
GTCTTCTCAA GATCCGAGTC TACCCAGCCC TTGTTAGATT CAGAGGAGCG GCCAGAAGAT 2640 
CTACAATTAG TAGATCATGT AGATGGCGGT GATGGTATTT TGCCCAGGCA ACAGTACTTC 2700 
AAACAG AACT GCAGTCAGCA TGA ATCCAGT CCAGATATTT CACATTTTGA AAGGTCAAAG 2760 
CAAGTTTCAT CAGTCAATGA GG AAGATTTT GTTAGACTTA AACAGCAG AT TTCAG ATCAT 2820 
ATTTCACAAT CCTGTGG ATC TGGGCAAATG AAAATGTTTC AGGAAGTTTC TGCAGCAGAT 2880 
GCTTTTGGTC CAGGTACTGA GGGACA AGTA GAAAGATTTG AAACAGTTGG CATGGAGGCT 2940 
GCGACTGATG AAGGCATGCC TAAAAGTTAC TTACCACAGA CTGTACGGCA AGGCGGCTAC 3000 
ATGCCTCAGHG^AGGACTAG TAGTTCCTGC TACAACTTCA GCAGTACCTA TAAAGTAAAG 3060 
CTAAAATGAT TTTATCTGTG AATTC 

Scq ID NO: 32 Protein sequence: 
Protein Accession 0; NP_002 J 75.1 

1 II 21 31 41 51 
I I I t I I 

MLTLQTWWQ ALFIFLTTES TGELLDPCGY ISPESPWQL HSNFTAVCVL KEKCMDYFHV 60 
NANY1VWKTN HFTIPKEQYT IINRTASSVT FTD1ASLNIQ LTCNILTFGQ LEQNVYGITI 120 
ISGLPPEKPK NLSCIVNEGK KMRCEWDGGR ETHLETNFTL KSEWATHKFA DCKAKRDTPT 1 80 
SCTVDYSTVY FVNIE VWVEA ENALGKVTSD HINFDPVYKV KPNPPHNLSV INSEELSSIL 240 
KLTWTNPSIK SVIILKYNIQ YRTKDASTWS QEPPEDTAST RSSFTVQDLK PFTEYVFRIR 300 
CMKEDGKGYW SDWSEEASGI TYEDRPSKAP SFWYKIDPSH TQGYRTVQLV WKTLPPFEAN 360 
GKILDYEVTL TRWKSHLQNY TVNATKLTVN LTNDRYLATL TVRNLVGKSD AAVLTIPACD 420 
FQATHPVMDL KAFPKDNMLW VEWTTPRES V KKYILEWCVL SDKAPCITDW QQEDGTVHRT 480 
YLRGNLAESK CYUTVTPVY ADGPGSPESI KAYLKQAPPS KGPTVRTKKV GKNEA VLEWD 540 
QLPVDVQNGF 1RNYTIFYRT OGNETAVNV DSSHTEYTLS SLTSDTLYMV RMAAYTDEGG 600 
KDGPEFTFTT PKFAQGEIEA IWPVCLAFL LTTLLGVLFC FNKRDUKKH IWPNVPDPSK 660 
SH1AQWSPHT PPRHNFNSKD QMYSDONFTD VSWEIEAND KKPFPEDLKS LDLFKKEKIN 720 
TEGHSSGIGG SSCMSSSRPS ISSSDENESS QNTSSTVQYS TWHSGYRHQ VPSVQVFSRS 780 
ESTQPLLDSE ERPEDLQLVD HVDGGDGILP RQQYFKQNCS QHESSPDISH FERSKQVSSV 840 
NEEDFVRLKQ Q1SDHISQSC GSGQMKMFQE VSAADAFGPG TEGQVERFET VGMEAATDEO 900 
MPKSYLPQTV RQGGYMPQ 

Seq ID NO: 33 DNA sequence 

Nucleic Acid Accession #: NM_0 1 8255.1 

Coding sequence: 1 1 -249 1 (underlined sequences correspond to start and stop colons) 

1 11 21 31 41 51 
1)1111' 

AGTTGGCG AC ATGGTGGCAC CCGTGCTGGA GACTTCTCAC GTGTTTTGCT GCCCAAACCG 60 
GGTGCGGGG A GTCCTGAACT GG AGCTCTGG GCCCAG AGGA CTTCTGGCCT TTGGCACGTC 120 
CTGCTCCGTG GTOCTCTATG ACCCCCTGAA AAGGGTTGTT GTTACCAACT TGAATGGTCA 1 80 
CACCGCCCGA GTCAATTGCA TACAGTGGAT TTGTAAACAG GATGGCTCCC CTTCTACTGA 240 
ATTAGTTTCT GGAGGATCTG ATAATCAAGT GATTCACTGG GAAATAGAGG ATAATCAGCT 300 
TTTAAAAGCA GTGCATCTTC AAGGCCATGA AGGACCTGTT TATGCGGTGC ATGCTGTTTA 360 
CCAGAGGAGG ACATCAGATC CTGCATTATG TACACTGATC GTTTCTGCAG CTGCAGATTC 420 
TGCTGTTCGA CTCTGGTCTA AAAAGGGTCC AGAAGTAATG TGCCTTCAGA CTTTAAACTT 480 
TGGAAATGGA TTTGCTTTGG CTCTCTGCTT ATCTTTTTTG CCAAATACTQ ATGTACCAAT 540 
ATTAGCATGT GGCAATGATG ATTGCAGAAT TCACATATTT GCTCAACAAA ATGATCAGTT 600 
TCAGA AAGTG CTTTCTCTCT GTGGACATGA GGATTGGATT AGAGGAGTGG AATGGGCAGC 660 
CTTTGGTAGA GATCTTTTCC TAGCAAGCTG TTCACAAGAT TGCCTGATAA GAATATGGAA 720 
GCTGTATATA AAGTCAACAT CTTTAGAAAC TCAGGATGAC G ATAACATA A GACTGAAAG A 780 
AAATACTTTT ACCATAGA A A ATG AAAGTGT TAAAATAGCA TTTGCTGTTA CTCTGGAGAC 840 
AGTGCTAGCC GGTCATGAAA ACTGGGTAAA TGCAGTTCAC TGGCAACCTG TGTTTTACAA 900 
AG ATGGTGTC CTACAGCAGC CAGTGAGATT ATTATCTGCT TCCATGGATA AAACCATGAT 960 
TCTCTGGGCT CCAGATGAAG AGTCAGGAGT TTGGCTAGAA CAGGTTCGAG TAGGTGAAGT 1020 
AGGTGGGAAT ACTTTGGGAT TTTATGATTG CCAGTTCAAT GAAGATGGCT CCATGATCAT 1080 
TGCTCATGCT TTCCACGGAG CGTTGCACCT TTGG AAACAG AATACAGTTA ACCCAAGAGA 1 140 
GTGGACTCCA GAGATTGTCA TTTCAGGACA CTTTGATGGT GTCCAAGACC TAGTCTGGGA 1200 
TCCAG AAGGA GAATTTATTA TCACTGTTGG TACTGATCAG ACAACTAG AC TTTTTGCTCC 1260 
ATGGAAGAGA AAAGACCAAT CACAGGTGAC TTGGCATGAA ATTGCAAGGC CTCAGATACA 1320 
TGGGTATGAC CTGAAATGTT TGGCAATGAT TAATCGGTTT CAGTTTGTAT CTGGAGCAGA 1380 
TGAAAAAGTT CTTCGGGTTT TTTCTGCACC TCGGAATTTT GTGGAAAATT TTTGTGCCAT 1440 
TACAGGACAA TCACTGAATC ATGTGCTCTG TAATCAAGAT AGTGATCTTC CAGAAGGAGC 1500 
CACTGTCCCT GCATTGGGAT TATCAAATAA AGCTGTCTTT CAGGGAGATA TAGCTTCTCA 1560 
GCCTTCTGAT GAAGAGGAGC TGTTAACTAG TACTGGTTTT GAGTATCAGC AGGTGGCCTT 1620 
TCAGCCCTCC ATACTTACTG AGCCTCCCAC TGAGGATCAT CTTCTGCAG A ATACTTTGTG 1 680 
GCCTGAAGTT CAAAAACTAT ATGGGCACGG TTATGAAATA TTTTGTGTTA CTTGTAACAG 1740 
TTCAA AGACT CTGCTTGCCT CAGCTTGTAA GGCAGCTAAG AAAGAGCATG CAGCTATCAT 1800 
TCTTTGG AAC ACTACATCTT GG AAACAGGT GCAG AATTTA GTTTTCCACA GTTTGACAOT 1 860 
CACGCAGATG GCCTTCTCAC CTAATGAGAA GTTCTTACTA GCTGTTTCCA GAGATCGAAC 1920 
CTGGTCATTG TGGA AAAAGC AGGATACAAT CTCACCTGAG TTCGAGCCAG TTTTTAGTCT 1980 
TTTTGCCTTC ACCAACAAAA TTACTTCTOT GCACAGTAGA ATTATTTGGT CTTGTGATTG 2040 
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GAGTCCTGAC AGCAAGTATT TCTTCACTGG GAGTCGAGAC AAAAAGGTGG TTGTCTGGGG 2100 
TGAGTGCGAC TCCACTGATG ACTGTATTGA GCACAACATT GGCCCCTGCT CCTCAGTCCT 2160 
GGACGTGGGT GGGGCTGTGA CAGCTGTCAG CGTCTGCCCA GTGCTCCACC CTTCTCAACG 2220 
ATACGTGGTT GCAGTAGGAT TGG AGTGTGG AAAGATTTGC TTATATACCT GG AAAA AG AC 2280 
TGATCAAGTT CCAGAAATAA ATGACTGGAC CCACTGTGTA GAAACAAGTC AAAGCCAAAG 2340 
TCATACACTG GCTATCAGAA AATTATGCTG GAAGAATTGC AGTGGAAAAA CTGAACAGAA 2400 
GG AAGCAGAA GGTGCTG AGT GGTTACACTT TGCAAGCTGT GGTGAAGATC ACACTGTGAA 2460 
GATACACAGA GTCAATAAAT GTGCACT GTA AT GG 

Seq ID NO: 34 Protein sequence; 
Protein Accession #: NP_060725. 1 

1 11 21 31 41 51 
I I I I I I 

MVAPVLETSH VFCCPNRVRG VLNWSSGPRG LLAFGTSCSV VLYDPLKRVV VTNLNGHTAR 60 
VNCIQW1CKQ DGSPSTELVS GGSDNQV1HW EIEDNQLLKA VHLQGHEGPV YAVHAVYQRR 120 
TSDPALCTLI VSAAADSAVR LWSKKGPEVM CLQTLNFGNG FALALCLSFL PNTDVPILAC 1 80 
GNDDCR1HIF AQQNDQFQKV LSLCGHEDWI RGVEWAAFGR DLFLASCSQD CURIWKLYI 240 
KSTSLETQDD DNIRLKENTF TIENESVKIA FAVTLETVLA GHENWVNAVH WQPVFYKDGV 300 
LQQPVRLLSA SMDKTMILWA PDEESGVWLE QVRVGEVGGN TLGFYDCQFN EDGSMHAHA 360 
FHG ALHLWKQ NTVNPREWTP EIVISGHFDG VQDLVWDPEG EFIITVGTDQ TTRLFAPWKR 420 
KDQSQVTWHE IARPQIHGYD LKCLAMINRF QFVSG ADEKV LRVFSAPRNF VENFCAITGQ 480 . 
SLNHVLCNQD SDLPEGATVP ALGLSNKAVF QGDIASQPSD EEELLTSTGF EYQQVAFQPS 540 
ILTEPPTEDH LLQNTLWPEV QKLYGHGYEI FCVTCNSSKT LLASACKAAK KEHAAHLWN 600 
TTSWKQVQNL VFHSLTVTQM AFSPNEKFLL AVSRDRTWSL WKKQDTISPE FEPVFSLFAF 660 
TNKITSVHSR UWSCDWSPD SKYFFTGSRD KKVWWGECD STDDCIEHNI GPCSSVLDVG 720 
GAVTAVSVCP VLHPSQRYVV AVGLECGKIC LYTWKKTDQV PEINDWTHCV ETSQSQSHTL 780 
AIRKLCWKNC SGKTEQKEAE G AEWLHFASC GEDHTVKIHR VNKCAL 

Seq ID NO: 35 DNA sequence 

Nucleic Acid Accession*: NM 022131 

Coding sequence: 1 1 -2878 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

TGCTGCGAGG ATGCTGCCTG GGCGGCTGTG CTGGGTGCCG CTCCTGCTGG CGCTGGGCGT 60 
GGGGAGCGGC AGCGGCGGTG GCGGGGACAG CCGGCAGCGC CGCCTCCTCG CGGCTAAAGT 120 
CAATAAGCAC AAGCCATGGA TCGAGACTTC ATATCATGGA GTCATAACTG AGA ACAA TGA 1 80 
CACAGTCATT TTGGACCCAC CACTGGTAGC CCTGGATAAA GATGCACCGG TTCCTTTTGC 240 
AGGGGAAATC TGTGCGTTCA AGATCCATGG CCAGGAGCTG CCCTTTGAGG CTGTGGTGCT 300 
CAACAAGACA TCAGGAG AGG GCCGGCTCCG TGCCAAGAGC CCCATTGACT GTGAGTTGCA 360 
G AAGGAGTAC ACATTCATCA TCCAGGCCTA TGACTGTGGT GCTGGGCCCC ACGAGACAGC 420 
CTGGAAAAAG TCACACAAGG CCGTGGTCCA TATACAGGTG AAGGATGTCA ACGAGTTTGC 480 
TCCCACCTTC AAAGAGCCAG CCTACAAGGC TGTTGTG ACG GAGGGCAAGA TCTATGACAG 540 
CATTCTGCAG GTGGAGGCCA TTGACGAGGA CTGCTCCCCA CAGTACAGCC AGATCTGCAA 600 
CTATG AAATC GTCACCACAG ATGTGCCTTT TGCCATCGAC AGAAATGGCA ACATCAGGAA 660 
CACTGAGAAG CTG AG CTATG ACAAACAACA CCAGTATGAG ATCCTGGTGA CCGCCTACGA 720 
CTGTGGACAG AAGCCCGCTG CTCAGGACAC CCTGGTGCAG GTGGATGTGA AGCCAGTTTG 780 
CAAGCCTGGC TGGCAAGACT GGACCAAG AG GATTGAGTAC CAGCCTGGCT CCGGGAGCAT 840 
GCCCCTGTTC CCCAGCATCC ACCTGG AGAC GTGCG ATGGA GCCGTGTCTT CCCTCCAGAT 900 
CGTCACAGAG CTGCAGACTA ATTACATTGG GAAGGGTTGT GACCGGG AGA CCTACTCTGA 960 
GAAATCCCTT CAGAAGTTAT GTGGAGCCTC CTCTGGCATC ATTGACCTCT TGCCATCCCC 1020 
TAGCGCTGCC ACCAACTGGA CTG C AGG ACT GCTGGTGGAC AGCAGTG AGA TGATCTTCAA 1080 
GTTTGACGGC AGGCAGGGTG CCAAAATCCC CGATGGGATT GTGCCCAAGA ACCTGACCGA 1 140 
TCAGTTCACC ATCACCATGT GGATGAAACA CGGCCCCAGC CCTGGTGTGA GAGCCGAGAA 1200 
GGAAACCATC CTCTGCAACT CAGACAAAAC CGAAATGAAC CGGCATCACT ATGCCCTGTA 1260 
TGTGCACAAC TGCCGCCTCG TCTTTCTCTT GCGGAAGGAC TTCGACCAGG CTGACACCTT 1320 
TCGCCCCGCG GAGTTCCACT GGAAGCTGGA TCAGATTTGT GACAAAGAGT GGCACTACTA 1380 
TGTCATCAAT GTGG AGTTTC CTGTGGTAAC CTTATACATG GATGGAGCAA CATATGAACC 1440 
ATACCTGGTG ACCAACGACT GGCCCATTCA TCCATCTCAC ATAGCCATGC A ACTCA CAGT 1500 
CGGCGCTTGT TGGCAAGGAG GAGAAGTCAC CAAACCACAG TTTGCTCAGT TCTTTCATGG 1560 
AAGCCTGGCC AGTCTCACCA TCCGCCCTGG CAAAATGGAA AGCCAGAAGG TGATCTCCTG 1620 
CCTGCAGGCC TGCAAGGAAG GGCTGGACAT TAATTCCTTG GAAAGCCTTG GCCAAGGAAT 1680 
AAAGTATCAC TTCAACCCCT CGCAGTCCAT CCTGGTGATG GAAGGTGACG ACATTGGGAA 1740 
CATTAACCGT GCTCTCCAGA AAGTCTCCTA CATCAACTCC AGGCAGTTCC CAACGGCGGG 1800 
TGTGCGGCGC CTCAAAGTAT CCTCCAAAGT CCAGTGCTTT GGGGAAGACG TATGCATCAG 1 860 
TATCCCTGAG GTAGATGCCT ATGTGATGGT CCTCCAGGCC ATCGAGCCCC GGATCACCCT 1920 
CCGGGGCACA GACCACTTCT GG AGACCTGC TGCCCAGTTT GAAAGTGCCA GGGGAGTGAC 1980 
CCTCTTCCCT G ATATCAAG A TTGTGAGCAC CTTCGCCAAA ACCG AAGCCC CCGGGG ACGT 2040 
GAAAACCACA GACCCCAAAT CAGAAGTCTT AGAGGAAATG CTTCATAACT TAGATTTCTG 2100 
TGACATTTTG GTGATCGGAG GGGACTTGGA CCCAAGGCAG GAGTGCTTGG AGCTCAACCA 2160 
CAGTGAGCTC CACCAACGAC ACCTGG ATGC CACTAATTCT ACTGCAGGCT ACTCCATCTA 2220 
CGGTGTGGGC TCCATGAGCC GCTATGAGCA GGTGCTACAT CACATCCGCT ACCGCAACTG 2280 
GCGTCCGGCT TCCCTTGAGG CCCGGCGTTT CCGGATTAAG TGCTCAGAAC TCAATGGGCG 2340 
CTACACTAGC AATGAGTTCA ACTTGGAGGT CAGCATCCTT CATGAAGACC AAGTCTCAG A 2400 
TAAGG AGCAT GTCAATCATC TG ATTGTGCA GCCTCCCTTC CTCCAGTCTG TCCATCATCC 2460 
TGAGTCCCGG AGTAGCATCC AG CACAO TTC AGTGGTCCCA AGCATTGCCA CAGTGGTCAT 2520 
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i 



CATCATCTCC GTGTGCATGC TTGTGTTTGT CGTGGCCATG GGTGTGTACC GGGTCCGGAT 2580 
CGCCCACCAG CACTTCATCC AGGAGACTGA GGCTGCCAAG GAATCTG AGA TGGACTGGGA 2640 
CGATTCTGCG CTGACTATCA CAGTCAACCC CATGGAGAAA CATGAAGGAC CAGGGCATGG 2700 
GGAAGATGAG ACTGAGGGAG AAGAGGAGGA AGAAGCCGAG GAAGAAATGA GCTCCAGCAG 2760 
TGGCTCTGAC GACAGCGAAG AGGAGGAGGA GGAGGAAGGG ATGGGCAGAG GCAGACATGG 2820 
GCAGAATGG A GCCAGGCAAG CCCAGCTGGA GTGGGATGAC TCCACCCTCC CCTACIAGJG 2880 
CCCAGGGGTC TGCTGCCTGG CCCACATGTC CCTTTTGTAA ACCCTGACCC AGTGTATGCC 2940 
CATGTCTATC ATACCTCACC TCTGATOTCT GTG ACATGTC TGGGAAGGCC TTCTCCAGCT 3000 
TCCTGGAGCC CACCCTTTAA GCCTTGGGCA CTCCCTGTGT TTCATCCATG GGGAAGTTCC 3060 
AAGAAGCCCA GCATGGCCAT CAGTGAGGAC TTCAG GGT AG ACTTTGTCC T GTAGCCTCC A 3120 . 
CTTCTGCCCT AAGTTCCCCA GCATCCTG AC TACCTGTCTG CAG AGTTTGC CTTTGTTTTT 3 1 80 . 
TCCTGCAGGG AAGAAGGCCC ACCTTTGTGT CACTCACCTC CCCAGGCTCA GAGTCCCCAA 3240 
GGCCCTGGGG TTCCAACTCA CTGTGCGTCT CCTCCACACA GACCAGTAGG TTCTCCTATG 3300 
CTG ACTCCAG GTTGCTTCAT ACAAGGAGGG TGGTTGAACT TCACACACGT AAGGTCTTAG 3360 
TGCTTAACAG TTTAAAGGAA AGTCCTTGTT GAGGCAGAAC TAAGTTTACA GGGAAAGGTA 3420 
CACACATTCT CTCTCTCTCT CTCTCTCTGT CTATCTAGTT CCCCAGCTTG GAGAGCCTTT 3480 
GCCCTTGCTT CTTTCTGAGG CCATATAAGC TTATAAGAAA AGTCCCAAAC CAAGAATAGG 3540 
TCCTTGGOCA CAAGCAGGGT CTGATCCCCC ATCAG AGCTA TCTGAGCCTG CCTGTCTGGG 3600 
CACCTGCTGC AACCATGCAG CTACCCTGCC AGGGGCACTC AGCAAACAGA ACCACAGGGC 3660 
CCAGGAGGCA TTCCACACAG GCACTGCCCC AGGACAACAC AACAAGGACA GTCACAACAA 3720 
GGACAACAAG GACACAACAC AACACACAAC AAGGACAGTC ACAACAAGCC TAGAGCCAGA 3780 
AAGCAGATGG AAATGCTAAT GAGGTCAAAC GTAGGCTTCA TGGTGGGTGG AGTGGGGGTG 3840 
GCTGGGCTCC CCCAGGACACj AGGGGACCCT GAGGTTGGCA AGGCTCTCAC CACTCAGCCT 3900 
TATGGTCCCT TATCTCCT^T CTTCCCTCTT GAGAAAATAC ACGCTTTCTG CATGTATTAG 3960 , 
AAACGCACG A GCTCCACCAA GTCTACAATG AAAGTTTGAA ATTTAACTGC AAGG AATTAG 4020 
AAGCATATTT GCAATCATTG CAGCTTCTTC TTTCTTCTGC TCATAAAAGG AGGAACACTT 4080 
TAGATAGAGG GCAAATATAT CTGAAAACCT AATTTCTTTC TTTTTTTGAT AAGGAAATCT 4140 
TTTCCATCTC CATCCTAACA TGCACAACCT GTGAAGAGAA TTGTTTCTAT AGTAACTGGT 4200 
CTGTGATCTT TTGTGGCCAA GAGAATAGCA GGCAAGAATT AGGGCCTTG A CAGAATTTCC 4260 
ACGAAGCTCT G AGAACATGT TTOTTTCGAA TGTCTGATTC CTCTTTGTCA TCAATGTGTA 4320 
TGCTCTGTCC CCATCCTTCA CTCCTCCTCA AGCTCACACC AATTGGTTTG GCACAGGCAC 4380 
AGAGCTGGTC CCTAGTTAAG TGGCATTTAT GTTAAAAAAA AATAGTTCAG AATCTCAGCC 4440 
TTTTCTTTGT GTCATCAAAA CAGCTTAAG A AGGGGACTAC TGCCAATGTC CTCTAGTCTG 4500 
ACCTCCACCC AGGGAGGACC CATGGCAGGT CTTTTCAACT TTCTGATTCA TGAGAACAAC 4560 
CTTGTGAAGC TTTTCCCACC TCCTAAAGTG TTTTCTGCAT CTGTTCCTTC CTTTGGACCT 4620 
CACA ACAAAT CCTGTGAAGT AACTGAGACA TCTGTTGTTA GATACATTTT TGTGATGAGT 4680 
AAACTGAGGC TTCG 

Sea ID NO: 36 Protein sequence: 
Protein Accession #: NP_071414.1. 

1 11 21 31 41 51 

MLPGRLCWVP LLLALGVGSG SGGGGDSRQR RLLAAKVNKH KPW1ETSYHG VITENNDTVI 60 
LDPPLVALDK DAPVPFAGEI CAFKIHGQEL PFEA WLNKT SGEGRLRAKS PIDCELQKEY 120 
TFIIQAYDCG AGPHETAWKK SHKAWHIQV KDVNEFAPTF KEPAYKAWT EGKIYDS1LQ 180 
VEAIDEDCSP QYSQICNYEI VTTDVPFAID RNGNIRNTEK LSYDKQHQYE ILVTAYDCGQ 240 
KPAAQDTLVQ VDVKPVCKPG WQDWTKRIEY QPGSGSMPLF PSIHLETCDG AVSSLQIVTE 300 
LQTNYIGKGC DRETYSEKSL QKLCGASSGI IDLLPSPSAA TNWTAGLLVD SSEMIFKFDG 360 
RQGAKIPDGI VPKNLTDQFT ITMWMKHGPS PGVRAEKETI LCNSDKTEMN RHHYALYVHN 420 
CRLVFLLRKD FDQADTFRPA EFHWKLDQIC DKEWHYYVIN VEFPWTLYM DGATYEPYLV 480 
TNDWPIHPSH IAMQLTVGAC WQGGEVTKPQ FAQFFHGSLA SLURPGKME SQKVISCLQA 540 
CKEGLDINSL ESLGQGKYH FNPSQSILVM EGDDIGNINR ALQKVSYINS RQFPTAGVRR 600 
LKVSSKVQCF GEDVC1SIPE VDAYVMVLQA IEPRITLRGT DHFWRPAAQF ESARG VTLFP 660 
DIK1VSTFAK TEAPGDVKTT DPKSEVLEEM LHNLDFCDIL VIGGDLDPRQ ECLELNHSEL 720 
HQRHLDATNS TAGYSIYGVG SMSRYEQVLH HIRYRNWRPA SLEARRFRK CSELNGRYTS 780 
NEFNLEVSIL HEDQVSDKEH VNHLIVQPPF LQSVHHPESR SSIQHSSWP SIATWinS 840 
VCMLVF WAM GVYRVRIAHQ HFIQETEAAK ESEMDWDDSA LTITVNPMEK HEGPGHGEDE 900 
TEGEEEEEAE EEMSSSSGSD DSEEEEEEEG MGRGRHGQNG ARQAQLEWDD STLPY 

Seq ID NO: 37 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 143-374 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

GGGAGGGAGA G AGGCGCGCG GGTGAAAGGC GCATTG ATGC AGCCTGCGGC GGCCTCGGAG 60 
CGCGGCGGAG CCAGACGCTG ACCACGTTCC TCTCCTCGGT CTCCTCCGCC TCCAGCTCCG 120 
CGCTGCCCGG CAGCCGGGAG C CATGC GACC CCAGGGCCCC GCCGCCTCCC CGCAGCGGCT 180 
CCGCGGCCTC CTGCTGCTCC TGCTGCTGCA GCTGCCCGCG CCGTCGAGCG CCTCTGAGAT 240 
CCCCAAGGGG AAGCAAAAGG CGCAGCTCCG GCAGAGGGAG GTGGTGGACC TGTATAATGG 300 
AATGTGCTTA CAAGGGCCAG CAGGAGTGCC TGGTCGAGAC GGGAGCCCTG GGGCCAATGG 360 
CATTCCGGGT ACACCTGGGA TCCCAGGTCG GGATGGATTC AAAGGAGAAA AGGGGGAATG 420 
TCTGAGGGAA AGCTTTGAGG AGTCCTGGAC ACCCAACTAC AAGCAGTGTT CATGGAGTTC 480 
ATTG AATTAT GGCATAGATC TTGGGAAAAT TGCGGAGTGT ACATTTACAA AGATGCGTTC 540 
AAATAGTGCT CTAAG AGTTT TGTTCAGTGG CTCACTTCGG CTAAAATOCA GAAATGCATG 600 
CTGTCAGCGT TGGTATTTCA CATTCAATGG AGCTGAATGT TCAGGACCTC TTCCCATTGA 660 
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AGCTATAATT TATTTGGACC AAGGAAGCCC TGAAATGAAT TCAACAATTA ATATTCATCG 720 
CACTTCTTCT GTGGAAGQAC TTTGTGAAGG AATTGOTGCT GGATTAGTGG ATGTTGCTAT 780 
CTGGGTTGGC ACTTGTTCAG ATTACCCAAA AGG AGATGCT TCTACTGG AT GG A ATTCAGT 840 
TTCTCGCATC ATTATTG AAG AACTACCAAA ATAAATGCTT TAATTTTCAT TTGCTACCTC 900 
H mil ATT ATGCCTTGGA ATGGTTCACT TAAATGACAT TTTA AATAAG TTTATGTATA 960 
CATCTGAATG AAAAGCAAAG CTAAATATGT TTACAGACCA AAGTGTGATT TCACACTGTT 1020 
TTTAAATCTA GCATTATTCA TTTTGCTTCA ATCAAAAGTG GTTTCAATAT TTTTTTTAGT 1080 
TGGTTAGAAT ACTTTCTTCA TAGTCACATT CTCTCAACCT ATAATTTGOA ATATTGTTGT 1 140 
GGTCTTTTGT TTTTTCTCTT AGTATAGCAT TTTTAAAAAA ATATAAAAGC TACCAATCTT 1200 
TGTACAATTT GTAAATGTTA AG AATTTTTT TTATATCTGT TAAATAAAAA TTATTTCCAA 1260 
CAACCTTAAA AAAAAAAAAA AAAA 

i 

Sea ID NO: 38 Protein sequence; 
Protein Accession U: none found 

1 11 21 , 31 41 51 

MRPQGPAASP QRLRGLLLLL LLQLPAPSSA SEIPKGKQKA QLRQREVVDL YNGMCLQGPA 60 
GVPGRDGSPG ANG1PGTPGI PGRDGFKGEK GECLRESFEE SWTPNYKQCS WSSLNYGIDL 120 
GKJAECTFTK MRSNSALRVL FSGSLRLKCR NACCQRWYFT FNG AECSGPL PIEAIIYLDQ 1 80 
GSPEMNSTIN IHRTSSVEGL CEGIGAGLVD VAIWVGTCSD YPKGDASTGW NSVSR1IIEE 240 
LPK 

* 

Seq ID NO: 39 DKA sequence 

Nucleic Acid Accession #: NM0O0949 

Coding sequence: 285-2 1 5 3(undcrli ned sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGAGGCTGAA ATCCCCAGAC GCCGGTTTTC TGGGCTGGGC TTTCTGCTTA CTCACTCCTT 60 
CTCCCTCTTT CTGGATTTTA CCGACCGTTC GCGAAACAGC TT TCCA CACA ATGGAGCTTC 120 
ATGTCCTCGT GCAGGAAGTA CTCATCGACT GATGTGGCAG ACTTTGCTCC CTGACAAAAC 1 80 
TAAAG AACTC TCCTATTCAT GGAGGCGAAC ACTGAGGATG CTTTCCACAT GAACCCTGAA 240 
GTGAACTTCT GATACATTTC CTGCAGCAAG AGAAGOCAGC CAACATOAAG GAAAATGTGG 300 
CATCTGCAAC OGTTTTCACT CTGCTACTTT TTCTCAACAC CTGCCTTCTG AATGGACAGT 360 
TACCTCCTGG AAAACCTGAG ATCTTTAAAT GTCGTTCTCC CAATAAGGAA ACATTCACCT 420 
GCTGGTGGAG GCCTGGGACA GATGG AGGAC TTCCTACCAA TTATTCACTG ACTTACCACA 480 
GGGAAGOAGA GACACTCATG CATGAATGTC CAGACTACAT AACCGGTGGC CCCAACTCCT 540 
GCCACTTTGG CAAGCAGTAC ACCTCCATGT GGAGGACATA CATCATGATG GTCAATGCCA 600 
CTAACCAGAT GGGAAGCAGT TTCTCGGATG AACTTTATGT GGACGTGACT TACATAGTTC 660 
AGCCAG ACCC TCCTTTGGAG CTGGCTGTGG AAGTAAAACA GCCAGAAG AC AGAAAACCCT 720 
ACCTGTGGAT TAAATGGTCT CCACCTACCC TGATTGACTT AAAAACTGGT TGG TTCA CGC 780 
TCCTGTATGA AATTCGATTA AAACCCGAGA AAGCAGCTGA GTGGGAGATC CATTTTGCTG 840 
GGCAGCAAAC AG AGTTTAAG ATTCTCAGCC TACATCCAGG ACAGAAATAC CTTGTCCAGG 900 
TTCGCTGCAA ACCAGACCAT GGATACTGGA GTGCATGGAG TCCAGCGACC TT CATTC AQA 960 
TACCTAGTGA CTTCACCATG AATGATACAA CCGTGTGGAT CTCTGTGGCT GTCCTTTCTG 1020 
CTGTCATCTG TTTGATTATT GTCTGGGCAG TGGCTTTGAA GGGCTATAGC ATGGTGACCT 1080 
GCATCTTTCC GCCAGTTCCT GGGCCAAAAA TAAAAGGA7T TOA TGCTC AT CTGTTGGAGA 1 140 
AGGGCAAGTC TGAAGAACTA CTGAGTGCCT TGGGATGCCA AGACTTTCCT CCCACTTCTG 1200 
ACTATGAGGA CTTGCTGGTG GAGTATTTAG AAGTAGATGA TAGTG AGGAC CAGCATCTAA 1260 
TGTCAGTCCA TTCAAAAGAA CACCCAAGTC AAGGTATGAA ACCCACATAC CTGGATCCTG 1320 
ACACTGACTC AGGCCGGGGG AGCTGTGACA GCCCTTCCCT TTTGTCTGAA AAGTGTGAGG 1380 
AACCCCAGGC CAATCCCTCC ACATTCTATG ATCCTGAGGT CATTGAGAAG CCAG AGAATC 1440 
CTGAAACAAC CCACACCTGG GACCCCCAGT GCATAAGCAT GGAAGGCAAA ATCCCCTATT 1 500 
TTCATGCTGG TGGATCCAAA TGTTCAACAT GGCCCTTACC ACAGCCCAGC CAGCACAACC 1560 
CCAGATCCTC TTACCACAAT ATTACTGATG TGTGTGAGCT GGCTGTGGGC CCTGCAGGTG 1620 
CACCGGCCAC TCTGTTGAAT GAAGCAGGTA AAGATGCTTT AAAATCCTCT CAAACCATTA 1680 
AGTCTAGAGA AGAGGG AAAG GCAACCCAGC AGAGGGAGGT AGAAAGCTTC CATTCTGAGA 1740 
CTGACCAGG A TACGCCCTGG CTGCTGCCCC AGGAGAAAAC CCCCTTTOGC TCCGCTAAAC 1 800 
CCTTGGATTA TGTGGAGATT CACAAGGTCA ACAAAGATGG TGCATTATCA TTGCTACCAA 1860 
AACAGAGAGA GAACAGCGGC AAGCCCAAGA AGCCCGGGAC TCCTGAGAAC AATAAGGAGT 1920 
ATGCCAAGGT GTCCGGGGTC ATGGATAACA ACATCCTGGT GTTGGTGCCA GATCCACATG 1980 
CTAAAAACGT GGCTTGCTTT G AAGAATCAG CCAAAGAGGC CCCACCATCA CTTGAACAGA 2040 
ATCAAGCTGA GAAAGCCCTG GCCAACTTCA CTGCAACATC AAGCAAGTGC AGGCTCCAGC 2100 
TGGGTGGTTT GGATTACCTG G ATCCCGCAT GTTTTACACA CTCCTTTCAC TGATAGCTTG 2160 
ACTAATGGAA TGATTGGTTA AAATGTGATT TTTCTTCAGG TAACACTACA GAGTACGTGA 2220 
AATGCTCAAG AATGTAGTCA G ACTG ACACT ACTAAAGCTC CCAGCTCCTT TCATGCTCCA 2280 
TTTTTAACCA CTTGCCTCTT TCTCCAGCAG CTGATTCCAG AACAAATCAT TATGTTTCCT 2340 
AACTGTGATT TGTAG ATTTA CTTTTTGCTG TTAGTTATAA AACTATGTGT TCAATG AAAT 2400 
AAAAGCACAC TGCTTAGTAT TCTTG AGGGA CAATGCCAAT AGGTATATCC TCTGGAAAAG 2460 
GCTTTCATGA TTTGGCATGG GACAGACGGA AATGAAATTG TCAAAATTGT TTACCATAGA 2520 
AAGATGACAA AAGAAAATTT TCCACATAGG AAAATGCCAT GAAAATTGCT TTTGAAAAAC 2580 
AACTGCATAA CCTTTACACT CCTCGTCCAT TTTATTAGG A TTACCCAA AT ATA ACCATTT 2640 
AAAGAAAG AA TGCATTCCAG AACAAATTGT TTACATAAGT TCCTATACCT TACTGACACA 2700 
TTGCTG ATAT GCAAGTAAGA AAT 



Seq ID NO: 40 Pjpjejnseouenoe; 
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Protein Accession #: NP_000940.1 

1 11 21 31 41 51 

• I I I 1 1 

MKENVASATV FTLLLFLNTC LLNGQLPPGK PEIFKCRSPN KETFTCWWRP GTDGGLPTNY 60 
SLTYHREGET LMHECPDYIT GGPNSCHFGK QYTSMWRTYI MMVNATNQMG SSFSDELYVD 120 
VTYIVQPDPP LELAVEVKQP EDRKPYLWCC WSPPTL1DLK TGWFTLLYEI RLKPEKAAEW 180 
EIHFAGQQTE FKILSLHPGQ KYLVQVRCKP DHGYWSAWSP ATFIQIPSDF TMNDTTVWIS 240 
VAVLS A VICL HVWAVALKG YSMVTCIFPP VPGPKKGFD AHLLEKGKSE ELLSALGCQD 300 
FPPTSDYEDL LVEYLEVDDS EDQHLMSVHS KEHPSQGMKP TYLDPDTDSG RGSCDSPSLL 360 
SEKCEEPQAN PSTFYDPEVI EKPENPETTH TWDPQCISME GKJPYFHAGG SKCSTWPLPQ 420 
PSQHNPRSSY HN1TDVCELA VGPAGAPATL LNEAGKDALK SSQTIKSREE GKATQQREVE 480 
SFHSETDQDT PWLLPQEKTP FGSAKPLDYV EIHKVNKDGA LSLLPKQREN SGKPKKPGTP 540 
ENNKEYAKVS GVMDNNILVL VPDPHAKNVA CFEESAKEAP PSLEQNQAEX ALANFTATSS 600 
KCRLQLGGLD YLDP ACFTHS FH 

Seq ID NO: 41 DNA sequence 

Nucleic Acid Accession #: none found, Eos cloned sequence 

Coding sequence: 1 -1 572 (underlined sequences correspond to start and stop codons) 

1 . 1! 21 ' 31 41 51 
I I I I 1 I 

ATGACCCAAA ATAAATTAAA GCTTTGTTCC AAAGCCAATG TGTATACTGA AGTGCCTGAT 60 
GGAGGATGGG GCTGGGCGGT AGCTGTTTCA TTTTTCTTCG TTGAAGTCTT CACCTACGGC 120 
ATCATCA AG A CATTTGGTGT CTTCTTTAAT GACTTAATGG ACAGT TTTA A TGAATCCAAT 1 80 
AGCAGG ATCT CATGGATAAT CTCAATCTGT GTGTTTGTCT TAACATTTTC AGCTCCCCTC 240 
GCCACAGTCC TGAGCAATCG TTTCGGACAC CGTCTGGTAG TGATGTTGGG GGGGCTACTT 300 
GTCAGCACCG GGATGGTGGC CGCCTCCTTC TCACAAGAGG TTTCTCATAT GTACGTCGCC 360 
ATCGGCATCA TCTCTGGTCT GGGATACTGC TTTAGTTTTC TCCCAACTGT AACCATCCTA 420 
TCACAATATT TTGGCAAAAG ACGTTCCATA GTCACTGCAG TTGCTTCCAC AGGAGAATGT 480 
TTCGCTGTGT TTGCTTTCGC ACCAGCAATC ATGGCTCTGA AGGAGCGCAT TGGCTGG AGA 540 
TACAGCCTCC TCTTCGTGGG CCTACTACAG TTAAACATTG TCATCTTCGG AGCACTGCTC 600 
AGACCCATCT TTATCAGAGG ACCAGCOTCA CCGAAAATAG TCATCCAGGA AAATCGGAAA 660 
GAAGCGCAGT ATATGCTTGA AAATG AGAAA ACACGAACCT CAATAGACTC CATTGACTCA 720 
GGAGTAG AAC TAACTACCTC ACCTAAAAAT GTGCCTACTC ACACTAACCT GGAACTGGAG 780 
CCGAAGGCCG ACATGCAGCA GGTCCTGGTG AAGACCAGCC CCAGGCCAAG CGAAAAGAAA 840 
GCCCCGCTAT TAG ACTTCTC CATTTTGAAA GAG AA AAGTT TTATTTGTTA TGCATTATTT 900 
GGTCTCTTTG CAACACTGGG ATTCTTTGCA OCTTCCTTGT ACATCATTCC TCTGGGCATT 960 
AGTCTGGGCA TTGACCAGGA CCGCGCTGCT TTTTTATTAT CTACGATGGC CATTGCAGAA 1020 
GTTTTCGGAA GGATCGGAGC TG OTTI 1GTC CTCAACAGGG AGCCCATTC G TAA GATTTAC 1080 
ATTGAGCTCA TCTGCGTCAT CTTATTGACT GTGTCTCTGT TTGCCTTTAC TTTTGCTACT 1 140 
GAATTCTGGG GTCTAATGTC ATGCAGCATA TTTTTTGGGT TTATGGTTGG AACAATAGGA 1200 
GGG ACTCACA TTCCACTG CT TGCTGAGGAT GATGTCGTGG GCATTGAGAA GATGTCTTCT 1260 
GCAGCTGGGG TCTACATCTT CATTCAGAGC ATAGCAGGAC TGGCTGGACC aCCCCTTGCA 1320 
GGTTTGTTGG TGGACCAAAG TAAGATCTAC AGCAGGGCCT TCTACTCCTG CGCAGCTGGC 1380 
ATGGCCCTGG CTGCTGTGTG CCTCGCCCTG GTGAGACCGT GTAAGATGGG ACTGTGCCAG 1440 
CATCATCACT CAGGTGAAAC AAAGGTAGTG AGCCATCGTG GGAAGACTTT ACAGGACATA 1500 
CCTGAAGACT TTCTGGAAAT GGATCTTGCA AAAAATGAGC ACAGAGTTCA CGTGCAAATG 1560 
GAGCCGGTA TGA 

Sea ID NO: 42 Protein sequence; 

Protein Accession #: none found, Eos cloned sequence 

1 11 21 31 41 51 
I I I I I I 

MTQNKUaCS KANVYTEVPD GGWGWAVAVS FFFVEVFTYG IIKTFGVFFN DLMDSFNESN 60 
SRISWIISIC VFVLTFSAPL ATVLSNRFGH RLWMLGGLL VSTGMVAASF SQEVSHMYVA 120 
IGIISGLGYC FSFLPTVTIL SQYFGKRRSI VTAVASTGEC FAVFAFAPAI MALKERIGWR 1 80 
YSLLFVGLLQ LNIVIFGALL RPIFIRGPAS PKIVIQENRK EAQYMLENEK TRTSIDSIDS 240 
GVELTTSPKN VPTHTNLELE PKADMQQVLV KTSPRPSEKK APLLDFSILK EKSFICYALF 300 
GLFATLGFFA PSLYIIPLGI SLGIDQDRAA FLLSTMAIAE VFGRIGAGFV LNREPIRK1Y 360 
IELICVILLT VSLFAFTFAT EFWGLMSCSI FFGFMVGTIG GTFUPLLAED DWGIEKMSS 420 
AAGVYIFIQS IAGLAGPPLA GLLVDQSKIY SRAFYSCAAG MALAAVCLAL VRPCKMGLCQ 480 
HHHSGETKVV SHRGKTLQDI PEDFLEMDLA KNEHRVHVQM EPV 

Seq ID NO: 43 DNA sequence 

Nucleic Acid Accession #: FGENESH predicted ORE 

Coding sequence: 1-1749 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

ATGCTGTCTG GCTTCTTGATGAGTCCCAGT ACCCAGCACA GAGCACAGTA CACTCCCGGA 60 
GGAAAGAAAC TTCCGTGGGA GGCTTCCATC GGTGCGCACA CCTCCCGAGG GCGAGGCAGC 120 
GACCGGGAG A GGG AGAGCCG GCCGGAGGCT GCCGGGCTCC TGTGGG ACCG CGCTGCAGCC 1 80 
GGGGAGGCGG AGAAGGGGA A CCGGGGCGAG CCGCCCGCCT GGATCCGCGC CCAGCAGCAG 240 
CCGCGGCCGC CGCCAGCTGG GCAGGCTCCC GGGACTGCGG CTGGGGGCGC GCAGG ACCCT 300 
CGCCTGCGTC CTGGACGTTC CCGGGGGAGG GTCCGGTTGC CAGTGAAACC TCCAGAGGCT 360 
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TCCGGACGAC AGCCCCGGGG GCCTTCTGAC TGCATCCCGA GATTTCCATC AGCGAGTGCA 420 
ACTCATAAGG CAGTCCCTAA GGGGACCGGG CCACCGGCTG AGGACGGGGA TGGCTTAGGA 480 
GCTCCTGGAC CTAGGGCCCG GCGTCGTCGC CTCCTGGGCG TCGCGGCAGA GGGGAGTGGC 540 
CCGCGCGGAA AGCGCCGCGG GACAGTCAGT GACGAGGCCC GGGGGTCGCC GGGGCCACGA 600 
CTTCTCGGAG ACCGTCCTGC GCTCTCTOGA GACGCGCTGT CCGCGCCCAG GGTGGTGCCA 660 
TGTGGGGCGC TCGCCGCTCG TCCGTCTCCT GATCCTGGAA CGCCGCTTCG CTCCTGCAGC 720 
TGCTGCTGGC TGCGCTGCTG GCGGCGGGGG CGAGGGCCCA GCGGCGAGTA CTGCCACGGC 780 
TGGCTGG ACG CGCAGGGCGT CTGGCGCATC GGCTTCCAGT GTCCCGAGCG CTTCG ACGGC 840 
GGCGACGCCA CCATCTGCTG CGGCAGCTGC GCGTTGCGCT ACTGCTGCTC CAGCGCCGAG 900 
GCGCGCCTGG ACCAGGGCGG CTGCG ACAAT G ACCGCCAGC AGGGCGCTGG CGAGCCTGGC 960 
CGGGCGGACA AAGACGGGCC CCGACGGCTC GGCAGGGCTT CATGTCTTAG GGGTACCCAA 1020 
GGAGACGGCG AGGGTGCGCC CqCACCCGTG AGGGCCTGGC AGCGGTGCTC CCCTGAAGGC 1080 
TCCCCGAAAG G AAGGCAGCT CCTCAGGGCT TTCCCGGGGC TGCTGCCCCG TGCCAGACGC 1 140 
CGCGGATTCC CATCTTCTCC ACGCGGCGGC CCCTCTCCCC TGCAGCGGCC CGCCTTGCCC 1200 
ATCTACGTGC CGTTCCTCAT TGTTGGCTCC GTGTTTGTCG CCTTTATCAT CTTGGGGTCC 1260 
CTGGTGGCAG CCTGTTGCTO CAGATGTCTC CGGCCTAAGC AGGATCCCCA GCAGAGCCGA 1320 
GCCCCAGGGG GTAACCGCTT GATGGAGACC ATCCCCATGA TCCCCAGTGC CAGCACCTCC 1380 
CGGOGGTCGT CCTCACGCCA GTCCAGCACA GCTGCCAGTT CCAGCTCCAG CGCCAACTCC 1440 
GGGGCCCGGG CGCCCCCAAC AAGGTCACAG ACCAACTGTT GCTTGCCGGA AGGGACCATG 1500 
AACAACGTGT ATGTCAACAT GCCCACGAAT TTCTCTGTGC TGAACTGTCA GCAGGCCACC 1560 
CAGATTGTGC CACATCAAGG GCAGTATCTG CATCCCCCAT ACGTGGGGTA CACGGTGCAG 1620 
CACGACTCTG TGCCCATGAC AGCTGTGCCA CCTTTCATGG ACGGCCTGCA GCCTGGCTAC 1680 
AGGCAGATTC AGTCCCCCTT CCCTCACACG AACAGTGAAC AGAAGATGTA CCCAGCGGTG 1740 
ACTOTATAAC CGAGAGTCAC TGGTGGGTTC CTTTACTGAA GGGAG ACGAA GGCAGGGGTG 1 800 
GATTCTCGAG GTGGAAGTCC GCACATGTCG GTGGTATTTA TGGCACGATT CCTTTGGATQ 1860 
GCTTCATTTG CCCCCAGACT GTATGAAAAC ATCTCCGAAT TAGCATTTCT GGATATGTTT 1920 
CATCCAGGGT ATCATTGATT TATGATGGAA AACCGGCCTC AGCTOGAGAT GACTGTGATG 1980 
TTGCTGATGG GTGTATAACA AATGCTTGAG TCCGAAGTGC CCTTGAGATA TGGTTGACGA 2040 
AAGAATTTTA TAAACTGATA AATTAAGGAT TTTTATTATG TTGTTATTAT TATTTCTTTT 2100 
TTGTTGTTGA CTGCACAGGA TCAAAATGCC TGTTATCTCC CTTTTACTGG G ACTTTTTTT 2160 
III 11 11 111 1 111111 1 AA TCAGACAGGG TCTTGCTCTG TTGCCCAGGC TGGAGTGCAG 2220 
TGGTGCGATC TCGGCTC ACT GC A ACTTC AG CCTCCTGOAT TCAGGCAACA CTCCTGCCTC 22 SO 
AGCCTCCCAC GTGGCTGGGA TTACAGGTGC CTGCCCCCAT GGCTAATTTT TTGTATTTTT 2340 
TGTAGAGATG GGGTTTCACC ATGTTGGCTG GGCTGGTCTC ACTCTCCTGA CCTCAAGCAA 2400 
TCTGCCTGTC TCAGCCTCCC AAAGTGCTGG GATTACAGGC GTGAGCCACC GCCOCCAGCC 2460 
TGAGCCTTTT TTTTTTTCTA ATGCATCCAA GGTTAAGGGG AAGACGCAAA TAACAGGACT 2520 
ATTCTAAAAG GAAACCTGTT TGAACTCTGT GAG ATCAGTC ATCAGTCTCA GTATTCCACA 2580 
GGCACACCTT AATT TCATT G TAAA AAG ATA f ATATATTTT GTCTATTTTT GTGCTTTTGG 2640 
GGGCCTATTT TGTGCTTTTT TACCTTATGT AGAGATCTTA TTACAAAGTG ATTTTCTACA 2700 
TTAAAAAGAG ACTGAAATAA ATTGTATAGT TACTTAACTA ATGAAGACAT TTCAG AACTC 2760 
TGGGATGATT TTAATCTTGA AGTAGTAGGT GGTATAGTCA TAAAACCATT CATCCCCTTC 2820 
TTGATTGTAT CTTAATTTTC TGGCTTTAAG GTGACATCTG AGAGGTAATG CATTCTTTTT 2880 . 
TATATTGAAA TCATAAACTA TCACCCGCTG CTTCTCTGAG TTACTTTTAA TTTTGCCTTG 2940 
TGGTTATGGT TTGGCGTTTC CTTCTGTTTG GTTTTCAGAG CCCCATGTCT ATATAGTCCT 3000 
GAGTGCAAGT AATTACTATA CTTGTAAATG AAG ATCAGTA TTTCTGCCTA GATCTGATAA 3060 
AAAAATTTTC TTGTCTTAGT TATAAAAATT CAAAGAAATG TGTTACAAAG ATACTTAGTA 3120 
TAGCTCCTCA GCCATAACCT GAGACTTGGG ATGAAATTTA AACCAGATAC GATTTACTTT 3 180 
GCAGATCATA AGGCTTTTTA TACTCTTGTT ATCAAAATGG CTTATTTTTC AGGCACTAAG 3240 
G ATTGTTAAG AGAAAAGCTT TTCAACGAAG GATTGCCTTT CTTCTCCCAC ACTGTTCTTG 3300 
ATTTCCTCTC TCTTTCAGGC CTCAACAGGC ACTGTATTCA TTGCCAATGT TCCAAATTAT 3360 
CAAATTCAAG TGAATTTATT TGTGTGTTCT TTACTTATAT AAAAA AAGAT AACTTTAAGG 3420 
ATGTGCAAGT ACATTTCCAA CTGCTAGCAC AACCAGTATT TTGTAATTAA ACAAATCGCT 3480 
GTATGGTATG GTCTTCTACA CATTTATGTC TATAGATATC TATCGATCAT CTTTCTATTC 3540 
TGTTTCATGA CTGAATAATG TAAAACCAGT GTTGGCAATT GGTATCATCA ATGATACTCA 3600 
TTTTTTAATA ACCAAAGGCA GGGGAAAATC ATTTTACTTA TTAATAAATA TTTTATGATG 3660 
TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 44 Protein sequence: 

Protein Accession #: FGENESH predicted 

1 U 21 31 41 51 
I I I I I I 

MLSGFLMSPS TQHRAQYTPG GKKLPWEASI GAHTSRGRGS DRERESRPEA AGLLWDRAAA 60 
GEAEKGNRGE PPAWIRAQQQ PRPPPAGQAP GTAAGGAQDP RLRPGRSRGR VRLPVKPPEA 120 
SGRQPRGPSD CIPRFPSASA THKAVPKGTG PPAEDGDGLG APGPRARRRR LLGVAAEGSG 180 
PRGKRRGTVS DEARGSPGPR LLGDRPALSG DALSAPRVVP CGALAARPSP HPGTPLRSCS 240 
CCWLRCWRRG RGPSGEYCHG WLDAQGVWRI GFQCPERFDG GDATICCGSC ALRYCCSSAE 300 
ARLDQGGCDN DRQQGAGEPG RADKDGPRRL GRASCLRGTQ GDGEGAPPPV RAWQRCSPEG 360 
SPKGRQLLRA FPGLLPRARR RGFPSSPRGG PSPLQRPALP IYVPFLIVGS VFVAFIILGS 420 
LVAACCCRCL RPKQDPQQSR APGGNRLMET IPMIPSASTS RGSSSRQSST AASSSSSANS 480 
G ARAPPTRSQ TNCCLPEGTM NNVYVNMPTN FSVLNCQQAT QIVPHQGQYL HPP YVGYTVQ 540 
HDSVPMTAVP PFMDGLQPGY RQIQSPFPHT NSEQKM YPAV TV 

Seq ID NO: 45 DNA sequence 

Nucleic Add Accession #: NM_002285 

Coding sequence: 55-3738 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 . 
I I I I I i 

GGCCGAGCCT CGGCGGCGGC GGTAGCGGCG GCGGCG ACGC TGACACCTCC CACCATGGAC 60 
AGCTTCGACT TAGCCCTGCT CCAGGAATGG GACCTCGAGT CACTGTGTGT CTATGAACCA 120 
GATAGAAATG CATTACGGAG GAAAGAACGA GAAAGAAGAA ATCAAGAAAC TCAACAGGAT 180 
GATGGCACGT TTAATTCTAG TTACTCTCTC TTCAGTGAGC CCTACAAGAC TAACAAGGGG 240 
G ATG AACTCT CCAACCGG AT CCAGAACACT TTAGGCAATT ATGATGAAAT GAAAGACTTT 300 
TTAACTGATA GAACCAATCA GAGTCATCTC GTTGGAGTTC CCAAACCTGG GGTTCCTCAG 360 
ACTCCTGTGA ACAAGATCGA TGAACATTTT GTTGCAGATT CAAG AGOCCA GAACCAGCCC 420 
TCGTCTATCT GTAGCACTAC AACTTCCACA CCAGCAGCTG TCCCCGTGCA GCAGAGTAAA 480 
AGAGGCACTA TGGGCTGGCA GAAGGCTGGG CACCCACCCT CTGACGGCCA ACAGAGAGCA 540 
ACACAACAGG GCTCTCTCAG GACCTTGCTT GGAGATGGTG TTGGCAGACA GCAGCCTCGG 600 
GCCAAACAAG TGTGCAATGT GGAGGTGGGC CTTCAGACCC AGGAGAGGCC ACCTGCCATG 660 
GCGGCCAAGC ACAGCAGCAG CGGACACTGT GTTCAGAACT TTCCTCCATC CCTAGCTTCA 720 
AAACCCAGCC TGGTCCAGCA G AAACCGACC GCGTATGTGA GGCCAATGGA CGGCCAAG AT 780 
CAGGCCOCTG ATG AGTCTCC TAAGCTGAAG TCGTCTTCGG AAACCAGCGT GCACTGCACA 840 
TCATACAGGG GAGTCCCTGC CAGCAAGCCG GAGCCTGCCA GAGCCAAGGC CAAGCTCTCC 900 
AAGTTCAGCA TCCCCAAGCA GGGGGAGGAG AGTAGATCTG GAGAAACCAA CAGCTGTGTT 960 
GAAGAAATAA TCCGGGAGAT GACCTGGCTT CCACCACTTT CTGCTATTCA AGCACCTGGC 1020 
AAAGTGGAAC CAACCAAATT TCCATTTCCA AATAAGGACT CTCAGCTTGT ATCCTCTGGA 1080 
CACAATAATC CAAAGAAAGG TGATGCAGAG CCAGAGAGTC CAGACAATGG CACATCGAAT 1140 
ACATCAATGC TGGAAGATGA CCTTAAGCTA AGCAGTGATG AAGAGGAGAA TGAACAGCAG 1200 
GCAGCTCAGA GAACGGCTCT CCGCGCTCTC TCTGACAGCG CCGTGGTCCA GCAGCCCAAC 1260 
TGCAGAACCT CGGTGCCTTC CAGCAAGGGC AGCAGCAGCA GCAGCAGCAG CGGCACGAGC 1320 
AGCTCCTCCA GCGACTGAGA GAGCAGCTCC GGATCTGACT CGGAGACCGA GAGCAGCTCC 1380 
AGCGAGAGTG AGGGCAGCAA GCCCCCCCAC TTCTCCAGCC CCGAGGCTGA ACCGGCATCC 1440 
TCTAACAAGT GGCAGCTGGA TAAATGGCTA AACAAAGTTA ATCCCCACAA GCCTCCTATT 1500 
CTGATCCAAA ATGAAAGCCA CGGGTCAGAG AGCAATCAGT ACTACAACCC GGTGAAAGAG 1560 
G ACGTCCAGG ACTGTGGGAA AGTCCCCGAC GTTTGCCAGC CCAGCCTGAG AGAGAAGGAG 1 620 
ATCAAGAGCA CTTGCAAGGA GGAGCAAAGG CCAAGGACAG CCAACAAGGC CCCTGGGAGT 1680 
AAAGGCGTGA AGCAGAAGTC CCCGCCCGCG GCCGTGGCCG TGGCGGTOAG CGCAGCCGCC 1740 
CCGCCACCCG CAGTGCCCTG TGCGCCCGCG GAGAACGCGC CCGCGCCTGC CCGGAGGTCC 1800 
GCGGGCAAGA AGCCCACCAG GCGCACCGAG AGGACCTCAG CCGGGGACGG CGCCAACTGC 1860 
CACCGGCCCG AGGAGCCCGC GGCCGCGGAC GCGCTGGGGA CGAGCGTGGT GGTCCCCCCG 1920 
GAGCCCACCA AAACCAGGCC CTGTGGCAAC AACAGAGCGA GCCACCGCAA GGAGCTGCGC 1980 
TCCTCCGTG A CCTGCGAGAA GCGCCGCACG CGGGGGCTAA GCAGGATCGT CCCCAAATCC 2040 
AAGGAGTTCA TTGAGACAGA GTCGTCATCT TCATCCTCCT CCTCGGACTC CGACCTGGAG 2100 
TCCGAGCAGG AGGAGTACCC TCTGTCCAAA GCACAGACCG TGGCTGCCTC TGCCTCCTCC 2160 
GGGAATGATC AG AGGCTG AA GGAGGCCGCT GCCAACGGGG GCAGTGGTCC TAGGGCCCCT 2220 
GTAGGCTCCA TCAACGCCAG GACCACCAGT G ACATCGCCA AGG AGCTGG A GGAGCAGTTC 2280 
TACACACTGG TCCCCTTTGG CCGGAACGAA CTTCTCTCCC CTCTAAAGGA CAGTGATGAG 2340 
ATCAGGTCTC TCTGGGTCAA AATCGACCTG ACCCTCCTGT CCAGGATCCC AGAACACCTG 2400 
CCCCAGG AGC CAGGGGTATT GAGCGCCCCT GCCACCAAGG ACTCTGAG AG COCACCGCCC 2460 
AGCCACACCT CGGACACACC TGCAGAAAAG GCTTTGCCAA AATCCAAGAG GAAACGCAAG 2520 
TGTGACAACG AAGACGACTA CAGGGAGATC AAGAAGTCCC AGGGAGAGAA AGACAGCTCT 2580 
TCAAGACTGG CCACCTCCAC CAGTAATACT TTGTCTGCAA ACCACTGCAA CATGAACATC 2640 
AACAGTGTGG CAATACCA AT AAATAA AAAT G AAAAAATGC TTCGGTCGCC CATCTCACCC 2700 
CTCTCTGATG CATCTAAACA CAAATACACC AGCG AGGACT TAACTTCTTC CAGCCGACCT 2760 
AATGGCAACA GTTTGTTTAC TTCAGCCTCT TCCAGCAAAA AGCCTAAGGC CGACAGCCAG 2820 
CTGCAGCCTC ACGGCGGAGA CCTCACGAAA GCAGCTCACA ACAATTCTGA AAACATTCCC 2880 
CTCCACAAGT CACGGCCGCA GACGAAGCCG TGGTCTCCAG GCTCCAACGG CCACAGGGAC 2940 
TGCAAGAGGC AG AAACTTGT CTTCGATGAT ATGCCTCGCA GTGCCGATTA TTTTATGCAA 3000 
GAAGCTAAAC GAATGAAGCA TAAAGCAGAT GCAATGGTGG AAAAGTTTGG AAAGGCTTTG 3060 
AACTATGCTG AAGCAGCATT GTCGTTTATC GAGTGTGGAA ATGCAATGGA ACAAGGCCCC 3120 
ATGGAATCCA AATCTCCTTA TTACCTGATG TATTCAGAAA CAGTAGAGCT CATCAGGTAT 3180 
GCTATGAGAC TAAAAACCCA CTCAGGCCCC A ATGCCACAC CAGAAGACAA ACAACTGGCT 3240 
GCATTATGTT ACCGATGCCT GGCCCTCCTG TACTGGCGG A TGTTTCGACT CAAAAGGGAC 3300 
CACGCTGTAA AGTATTCAAA AGCACTAATC GACTATTTCA AGAACTCATC TAAAGCCGCC 3360 
CAAGCCCCAT CTCCGTGGGG GGCCAGTGGA AAGAGCACTG GAACCCCATC CCCCATTTCT 3420 
CCCA ACCCCT TTCCCGGCAG CTCCGTGGGG TCTCAGGGCA GCCTCTCCAA CGCCAGCGCC 3480 
CTGTCCCCGT CG ACCATOGT CAGCATCCCA CAGCGCATCC ACCAGATGGC GGCCAACCAC 3540 
GTCAGCATCA CCAACAGCAT CCTGCACAGC TACGACTACT GGGAGATGGC CGACAACCTG 3600 
GCCAAGGAAA ACCGAGAATT CTTCAACGAC CTGGATCTGC TCATGGGGCC GGTCACCCTG 3660 
CACAGCAGCA TGGAGCACCT GGTCCAGTAC TCCCAACAGG GCCTGCACTG GCTGCGGAAC 3720 
AGCG CCCACC TGJCATAGGG ACCTCACCCT GGGGCCAGAG TGGGCTCTGG TCTCCACAGA 3780 
TGGCTCAACG TTTTTGGACA CTGTGCTACT GAAACTCCCA GCCACAGCAT TTATAGACTG 3840 
CGGTGAACAT TTCCTCA 

SeqIDNO: 46 Protein sequence: 
Protein Accession*: NP 002276 

1 11 21 31 41 51 
I I I I I I 

MDSFDLALLQ EWDLESLCVY EPDRNALRRK ERERRNQETQ QDDGTFNSSY SLFSEPYKTN 60 
KGDEL SNR1Q NTLGNYDEMK DFLTDRTNQS HLVG VPKPGV PQTPVNKIDE HFVADSRAQN 120 
QPSSICSTTT STPAAVPVQQ SKRGTMGWQK AGHPPSDG QQ RATQQGSLRT IXGDG VGRQQ 1 80 
PRAKQVCNVE VGLQTQERPP AMAAKHSSSG HCVQNFPPSL ASKPSLVQQK PTAYVRPMDG 240 
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QDQAPDESPK LKSSSETS VH GTSYRG VPAS KPEPARAKAK LSKFSIPKQO EESRSGETNS 300 
CVEE1IREMT WLPPLSAIQA PQKVEPTKFP FPNKDSQLVS SGHNNPKKGD AEPESPDNGT 360 
SNTSMLEDDL KLSSDEEENE QQA AQRTALR ALSDSA WQQ PNCRTSVPSS KGSSSSSSSG 420 
TSSSSSDSES SSGSDSETES SSSESEGSKP PHFSSPEAEP ASSNKWQLDK WLNKVNPHKP 480 
P1L1QNESHG SESNQYYNPV KEDVQDCGKV PDVCQPSLRE KEDCSTCKEE QRPRTANKAP 540 
GSKGVKQKSP PA AVAVAVSA A APPPAVPCA PAENAPAPAR RSAGKKPTRR TERTSAGDGA 600 
NCHRPEEPAA ADALGTSVW PPEPTKTRPC GNNRASHRKE LRSSVTCEKR RTRGLSRTVP 660 
KSKEFIETES SSSSSSSDSD LESEQEEYPL SKAQTVAASA SSGNDQRLKE AAANGGSGPR 720 
APVGSINART TSDIAKELEE QFYTLVPFGR NELLSPLKDS DEIRSLWVKI DLTLLSRIPE 780 
HLPQEPGVLS APATKDSESA PPSHTSDTPA EKALPKSKRK RKCDNEDDYR EIKKSQOEKD 840 
SSSRLATSTS NTLSANHCNM NINSVAIPIN KNEKMLRSPI SPLSDASKHK YTSEDLTSSS 900 
RPNGNSLFTS ASSSKKPKAD SQLQPHGGDL TKAAHNNSEN IPLHKSRPQT KPWSPGSNGH 960 
RDCKRQKLVF DDMPRSADYF MQEAKRMKHK ADAMVEKFGK ALMYAEAALS FIECGNAMEQ 1020 
GPMESKSPYY LMYSETVELI RYAMRLKTHS GPNATPEDKQ LAALCYRCLA LLYWRMFRLK 1080 
RDHAVKYSKA LIDYFKNSSK AAQAPSPWGA SGKSTGTPSP ISPNPFPGSS VGSQGSLSNA 1 140 
SALSPSTIVS IPQRIHQM AA NHVSITNSIL HSYDYWEMAD NLAKENREFF NDLDLLMGPV 1200 
TLHSSMEHLV QYSQQGLHWL RNSAHLS 

SeqIDNO: 47 DNA sequence 

Nucleic Acid Accession #: . NM 033151 

Coding sequence: 35 1 -4499(underiined sequences correspond to start and stop codons) 



I 11 21 31 1 41 51 

ACTGGGATAA AGCAAGAAGA CTGATTTTAT G AGCAGGGGT TTGATACATC AAAGG AGATT 60 
GCCCAGGATC AAGGGTGCGG TGTTGGGGGT GGGTTGGGGA GGGTGGTTAG AGAAGGTTTC 120 
ACTAAGTG AT TTGGGCCTGA GGCCTGAGAA GATGTTTAAA AAGAGGGATC AAGCACAGGC 1 80 
TAAGG AG AGG AAAGAGCAGG CACCCAAACC TCTGCATGGC CCCAATATGC TCCCTGCAGG 240 
GTAGTGCCCC CTCTTCTGGC TGCTCAAGGC GAGATCTAAG CTTCTTCTAA CTCCTGCTGT 300 
CTTTTCATAT TCTCTG ATTC TGGGAAACGA AG AATTGGCA GGAACTGAAA ATG ACTAGGA 360 
AGAGGACATA CTGGGTGCCC AACTCTTCTG GTGGCCTCGT GAATCGTGGC ATCGACATAG 420 
GCGATGACAT GGTTTCAGGA CTTATTTATA AAACCTATAC TCTCCAAGAT GGCCCCTGGA 480 
GTCAGCAAGA GAGAAATCCT GAGGCTCCAG GGAGGGCAGC TGTCCCACCG TGGGGGAAGT 540 
ATGATGCTGC CTTGAGAACC ATGATTCCCT TCCGTCCCAA GCCGAGGTTT CCTGCCCCCC 600 
AGCCCCTGGA CAATGCTGGC CTGTTCTCCT ACCTCACCGT GTCATGGCTC ACCCCGCTCA 660 
TGATCCAAAG CTTACGGAGT CGCTTAGATG AGAACACCAT CCCTCCACTG TCAGTCCATG 720 
ATGCCTCAGA CAAAAATGTC CAAAGGCTTC ACCGCCTTTG GGAAGAAGAA GTCTCAAGGC 780 
GAGGGATTGA AAAAGCTTCA GTGCTTCTGG TGATGCTGAG GTTCCAGAGA ACAAGGTTGA 840 
TTTTCGATGC ACTTCTGGGC ATCTGCTTCT GCATTGCCAG TGTACTCGGG CCAATATTGA 900 
TTATACCAAA GATCCTGGAA TATTCAGAAG AGCAGTTGGG GAATGTTGTC CATGGAGTGG 960 
GACTCTGCTT TGCC Cmi ICTCTCCGAAT GTGTGAAGTC TCTGAGTTTC TCCTCCAGTT 1020 
GGATCATCAA CCAACGCACA GCCATCAGGT TCCGAGCAGC TGTTTCXrTCC TTTGCCTTTG 1080 
AGAAGCTCAT CCAATTTAAG TCTGTAATAC ACATCACCTC AGG AG AGG CC ATCAGCTTCT 1 140 
TCACCGGTGA TGTAAACTAC CTGTTTGAAG GGGTGTGCTA TOGACCCCTA GTACTGATGA 1200 
CCTGCGCATC GCTGGTCATC TGCAGCATTT CTTCCTACTT CATTATTGGA TACACTGCAT 1260 
TTATTGCCAT CTTATGCTAT CTCCTGGTTT TCCCACTGGC GGTATTCATG ACAAGAATGG 1320 
CTGTGAAGGC TC AG CATC AC ACATCTGAGG TCAGCGACCA GCGCATCCGT GTGACCAGTG 1380 
AAGTTCTCAC TTGCATTAAG CTGATTAAAA TGTACACATG GGAGAAACCA TTTGCAAAAA 1440 
TCATTGAAGA CCTAAGAAGG AAGGAAAGGA AACTATTGGA GAAGTGCGGG CTTGTCCAGA 1500 
GCCTGACAAG TATAACCTTG TTCATCATCC CCACAGTGGC CACAGCGGTC TGGGTTCTCA 1560 
TCCACACATC CTTAAAGCTG AAACTCACAG CGTCAATGGC CTTCAGCATG CTGGCCTCCT 1620 
TG AATCTCCT TCGGCTGTCA GTGTTCTTTG TGCCTATTGC AGTCAAAGGT CT CACG AATT 1680 
CCAAGTCTGC AGTGATGAGG TTCAAG AAGT TTTTCCTCCA GG AG AGCCCT GTTTTCTATG 1 740 
TCCAGACATT ACAAGACCCC AGCAAAGCTC TGGTCTTTGA GGAGGCCACC TTGTCATGGC 1800 
AACAGACCTG TCCCGGGATC GTCAATGGGG CACTGGAGCT GGAGAGGAAC GGGCATGCTT I860 
CTGAGGGGAT GACCAGGCCT AGAGATGCCC TCGGGCCAGA GGAAGAAGGG AACAGCCTGG 1920 
GCCCAGAGTT GCACAAGATC AACCTGGTGG TGTCCAAGGG GATGATGTTA GGGGTCTGCG 1980 
GCAACACGGG GAGTGGTAAG AGCAGCCTGT TGTCAGCCAT CCTGGAGG AG ATGCACTTGC 2040 
TCGAGGGCTC GGTGGGGGTG CAGGGAAGCC TGGCCTATGT CCCCCAGCAG GCCTGGATCG 2100 
TCAGCGGGAA CATCAGGGAG AACATCCTCA TGGGAGGCGC ATATGACAAG GCCCGATACC 2160 
TCCAGGTGCT CCACTGCTGC TCCCTGAATC GGGACCTGGA ACTTCTGCCC TTTGG AG ACA 2220 
TGACAGAGAT TGGAGAGCGG GGCCTCAACC TCTCTGGGGG GCAGAAACAG AGGATCAGCC 2280 
TGGCCCGCGC CGTCTATTCC GACCGTCAG A TCTACCTGCT GG ACGACCCC CTGTCTGCTG 2340 
TGG ACGCCCA CGTGGGGAAG CACATTTTTG AGGAGTGCAT T AAGA AGACA CTCAGGGGGA 2400 
AGACGGTCGT CCTGGTGACC CACCAGCTGC AGTACTTAGA ATTTTGTGGC CAGATCATTT 2460 
TGTTGGAAAA TGGGAAAATC TGTGAAAATG GAACTCACAG TGAGTTAATG CAGAAAAAGG 2520 
GGAAATATGC CCAACTTATC CAGAAGATGC ACA AGG A AG C CACTTCGGAC ATGTTGCAGG 2580 
ACACAGCAAA GATAGCAGAG AAGCCAAAGG TAGAAAGTCA GGCTCTGGCC ACCTCCCTGG 2640 
AAGAGTCTCT CAACGGAAAT GCTGTGCCGG AGCATCAGCT CACACAGG AO GAGGAGATGG 2700 
AAGAAGGCTC CTTGAGTTGG AGGGTCTACC ACCACTACAT CCAGGCAGCT GGAGGTTACA 2760 
TGGTCTCTTG CATAATTTTC TTCTTCGTGG TGCTGATCGT CTTCTTA ACG ATCTTCAGCT 2820 
TCTGGTGGCT GAGCTACTGG TTGGAGCAGG GCTCGGGGAC CAATAGCAGC CGAGAGAGCA 2880 
ATGG AACCAT GGCAGACCTG GGCAACATTG CAGACAATCC TCAACTGTCC TTCTACCAGC 2940 
TGGTGTACGG GCTCAACGCC CTGCTCCTCA TCTGTGTG GG GGTCTGCTCC TCAGGGATTT 3000 
TCACCAAAGT CACGAGGAAG GCATCCACGG CCCTGCACAA CAA OCTCT TC AAC AAGGT TT 3060 
TCCGCTGCCC CATGAGTTTC TTTGACACCA TCCCAATAGG CCGGCTTTTG AACTGCTTCG 3120 
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CAGGGGACTT GGAACAGCTG GACCAGCTCT TGCCCATCTT TTCAGAGCAG TTCCTGGTCC 3180 
TGTCCTTAAT GGTG ATCGCC GTCCTGTTGA TTGTCAGTGT GCTGTCTCCA TATATCCTGT 3240 
TAAtGGGAGC CATAATCATG GTTATTTGCT TCATTTATTA TATGATGTTC AAGAAGGCCA 3300 
TCGGTGTGTT CAAGAGACTG GAGAACTATA GCCGGTCTCC TTTATTCTCC CACATCCTCA 3360 
ATTCTCTGCA AGGCCTG AGC TCCATCCATG TCTATGGAAA AACTGAAGAC TTCATCAGCC 3420 
AGTTTAAG AG GCTGACTGAT GCGCAG AATA ACTAOCTGCT GTTGTTTCTA TCTTCCACAC 3480 
GATGGATGGC ATTG AGGCTG G AG ATCATG A CCAACCTTGT GACCTTGGCT GTTGCCCTGT 3540 
TCGTGGCTTT TGGCATTTCC TCCACCCCCT ACTCCTTTAA AGTCATGGCT GTCAACATCG 3600 
TGCTGCAGCT GGCGTCCAGC TTCCAGGCCA CTGCCCGGAT TGGCTTGGAG ACAGAGGCAC 3660 
AGTTCACGGC TGTAG AGAGG ATACTGCAGT ACATG AAGAT GTGTGTCTCG GAAGCTCCTT 3720 
TACACATGGA AGGCACAAGT TGTCCCCAGG GGTGGCCACA GCATGGGGAA ATCATATTTC 3780 
AGGATTATCA CATGAAATAC AGAGACAACA CACCCACCGT OCTTCACGGC ATCAACCTGA 3840 
CCATCCGCGG CCACGAAGTG GTGGGCATCG TGGGAAGGAC GGGCTCTGGG AAGTCGTCCT 3900 
TGGGCATGGC TCTCTTCCGC CTGGTGGAGC CCATGGCAGG CCGGATTCTC ATTGACGGCG 3960 
TGGACATTTG CAGCATCGOC CTGGAGGACT TGCX3GTCCAA GCTCTCAGTG ATCCCTCAAO 4020 
ATCCAGTGCT GCTCTCAGGA ACCATCAGAT TCAACGTAGA TCCCTTTGAC CGTCACACTG 4080 
ACCAGCAGAT CTGGGATGCC TTGGAGAGGA CATTCCTGAC CAAGGCCATC TCAAAGTTCC 4140 
CCAAAAAGCT GCATACAGAT GTGGTGGAAA ACGGTGGAAA CTTCTCTGTG GGGGAGAGGC 4200 
AGCTGCTCTG CATTGCCAGG GCTGTGCTTC GCAACTCCAA GATCATCCTT ATCGATGAAG 4260 
CCACAGCCTC CATTGACATG GAGACAGACA CCCTGATCCA GCGCACAATC CGTGAAGCCT 4320 
TCCAGGGCTG CACCGTGCTC GTCATTGCCC ACCGTGTCAC CACTGTGCTG AACTGTGACC 4380 
ACATCCTGGT TATGGGCAAT GGGAAGOTGG TAGAATTTGA TCGGCCGGAG GTACTGCGGA 4440 
AGAAGCCTGG GTCATTGTTC GCAGCCCTCA TGGCCACAGC CACTTCTTCA CTGAGA3£AG 4500 
GAGATGTGGA GACTTCATQG AGGCTGGCAG CTGAGCTCAG AGGTTCACAC AGGTGCAGCT 4560 
TCGAGGCCCA CAGTCTGCGA CCTTCTTGTT TGGAGATGAG AACTTCTCCT GGAAGCAGGG 4620 
GTAAATGTAG GGGGGGTGGG GATTGCTGGA TGGAAACCCT GGAATAGGCT ACTTGATGGC 4680 
TCTCAAGACC TTAGAAOCCC AG AACCATCT AAG ACATGGG ATTCAGTG AT CATGTGQTTC 4740 
TCCTTTTAAC TTACATGCTG AATA ATTTTA TAATAAGGTA AAAGCTTATA GTTTTCTGAT 4800 
CTGTGTTAGA AGTGTTGCAA ATGCTGTACT GACTTTGTAA AATATAAAAC TAAGGAAAAC 4860 
TC 

Seq ID NO: 48 Protein sequence: 
Protein Accession S: NP 149163.2 

MTRKRTYWVPNSSGGLVNRGIDIGDDMVSGLIYKTYTLQIXjPWSQQERNPEAPGRAAVPPW 

M IPFRPKPRFPAPQPLDN AG LFS YLTVS WLTPI^IQSIJISRLDENTIPPLSVHDASDKNVQRLHRLWEEE 

VSRRGIEXASVLLVMLRFQRTTUJffDALl^ICFCIASVUjPIL^ 

l^ecvkslsfssswiinqrtairfra a vssfafekliqfksvihitsg baisfftgd vnylfeg vcy g pl 

vlitcaslvicsissyhigytafiaiixtyllvf^ 

likmytoekpfakiiedlrrkerkllekcglvqsltsltljtiptvatavw 

msimlrlsvffvpiavkgltnsksavmrfkkfflqe^pvfyvqtujdpska^ 

wgalelfjwghasegmtrprdalgpeeegnslgpelhkinlw^ 

mhlxegsvgvqgsl^yvpqqawivsgnirenimggaydkarylqvlhccsln^ 

glnlsggqkqrisij^rawsdrqiyllddpl^vdahvgkhifeecikktlrgkwvlvto 

qiiixengkicengthseiaiqkkgkyaqliqkmhkeatsdmlqdtakiaekpkvesqalatsl^ 

avpehqltqeeemeegslswrvyhhyiqaaggymvscotff^vliwltifsfwwi^ywl 

resngtmadlgniadkpqi^fyqlwglnaixlicvgvcssgiftkvtrkastalhnki^ 

fdtipigrllncfagdlequxjllpifseqflvi^lmviavi^ 

kkaigvfkrlekr'srspljshilnslqglssihvygkt^ 

eimtmvtlavalfvafgisstpysfkvmavnivixji^ssfq 

eaplhmegtscpqgwpqhgehfqdyhmkyrdntptvlhgi^^ 

lvepmagpjudgvdicsigledlrsklsvipqdpvu^gtiriw^ 

SKFPKKLHTDVVEhraGNFSVGERQLLCIARAVUW 
VlAHRVrrVLNCDHILVMGNGKWEFDRPEVUlKKPGSLFAAL^ 

Seq ID NO: 49 DMA sequence 

Nucleic Acid Accession #: NM 033419 

Coding sequence: 1 8-980 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

CGAGCCAGGG AGAAAGOATO GCCGGCCTGG CGGCGCGGTT GGTCCTGCTA GCTGQQGCAG 60 
CGGCGCTGGC GAGCGGCTCC CAGGGCGACC GTGAGCCGGT GTACCGCGAC TGCGTACTGC 120 
AGTGCGAAG A GCAGAACTGC TCTGGGGGCG CTCTGAATCA CTTCCGCTCC CGCCAGCCAA 1 80 
TCTACATG AG TCTAGCAGGC TGG ACCTGTC GGGACGACTG TAAGTATGAG TGTATGTGGG 240 
TCACCGTTGG GCTCTACCTC CAGGAAGGTC ACAAAGTGCC TCAGTTCCAT GGCAAGTGGC 300 
CCTTCTCCCG GTTCCTGTTC TTTCAAG AGC CGGC ATCGGC CGTGGCCTCG TTTCTCAATG 360 
GCCTGGCCAG CCTGGTGATG CTCTGCCGCT ACCGCACCTT CGTGCCAGCC TCCTCCCCCA 420 
TGTACCACAC CTGTGTGGCC TTCGCCTGGO TGTCCCTCAA TGCATGGTTC TGGTCCACAG 480 
TTTTCCACAC CAGGGACACT GACCTCACAG AGAAAATGGA CTACTTCTGT GCCTCCACTG 540 
TCATCCTACA CTCAATCTAC CTGTGCTGCG TCAGGACCGT GGGGCTGCAG CACCCAGCTG 600 
TGGTCAGTGC CTTCCGGGCT CTCCTGCTGC TCATGCTGAC CGTGCACGTC TCCTACCTGA 660 
GCCTCATCCG CTTCG ACTAT GGCTACAACC TGGTGGCCAA CGTGGCTATT GGCCTGGTCA 720 
ACGTGGTGTG GTGGCTGGCC TGGTGCCTGT GGAACCAGCG GCGGCTGCCT CACGTGCGCA 780 
AGTGCGTGGT GGTGGTCTTG CTGCTGCAGG GGCTGTCCCT GCTCGAGCTG CTTGACTTCC 840 
CACCGCTCTT CTGGGTCCTG GATGCCCATG CCATCTGGCA CATCAGCACC ATCCCTGTCC 900 
ACGTCCTCTT TTTCAGCTTT CTGG AAGATG ACAGCCTGTA CCTGCTG AAG GAATCAGAGG 960 
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ACAAGTTCAA GCTGG ACTGA AGACCTTGGA GCGAGTCTGC CCCAGTGGGG ATCCTGCCCC 1020 
CGCCCTGCTG GCCTCCCTTC TCCCCTCAAC CCTTGAGATG ATTTTCTCTT TTCAACTTCT 1080 
TGAACTTGGA CATG A AGGAT GTGGGCCCAG AATCATGTGG CCAGCCCACC CCCTGTTGGC 1 140 
CCTCACCAGC CTTGGAGTCT GTTCTAGGGA AGGCCTCCCA GCATCTGGGA CTCGAGAGTG 1200 
GGCAGCCCCT CTACCTCCTG GAGCTGAACT GGGGTGGAAC TGAGTGTGCT CTTAGCTCTA 1260 
CCGGGAGGAC AGCTGCCTGT TTCCTCCCCA TCAGCCTCCT CCCCACATCC CCAGCTGCCT 1320 
GGCTGGGTCC TGAAGCCCTC TGTCTACCTG GGAGACCAGG GACCACAGGC CTTAGGGATA 1380 
CAGGGGGTCC CCTTCTGTTA CCACCCCCCA CCCTCCTCCA GG AC ACT ACT AGGTGGTGCT 1440 
GGATGCTTGT TCTTTGGCCA GCCAAGGTTC ACGGCGATTC TCCCCATGGG ATCTTGAGGG 1500 
ACCAAGCTGC TGGG ATTGGG AAGGAGTTTC ACCCTG ACCA TTGCCCTAGC CAGGTTCCCA 1 560 
GGAGGCCTCA CCATACTCCC TTTCAGGG(X AGGGCTCCAG CAAGCCCAGG GCAAGGATCC 1620 
TGTGCTGCTG TCTGGTTGAG AGCCTGCCAC CGTQTGTCGG QAGTGTGGGC CAGGCTGAGT 1680 
GCATAGGTGA CAGGGCCGTG AGCATGGGCC TGGGTGTGTG TGAGCTCAGG CCTAGGTGCG 1740 
CAGTGTGGAG ACGGGTGTTG TCGGGGAAGA GGTGTGGCTT CAAAGTGTGT GTGTGCAGGG 1800 
GGTGGGTGTG TTAGCGTGGG TTAGGGGAAC GTGTGTGCGC GTGCTGGTGG GCATGTGAGA 1860 
TGAGTGACTG CCGGTGAATG TGTCCACAGT TGAGAGGTTG GAGCAGGATG AGGGAATCCT 1920 
GTCACCATCA ATAATCACTT GTGGAGCGCC AGCTCTGCCC AAGGCGCCAC CTGGGCGGAC 1980 
AGCCAGGAGC TCTCCATGGC CAGGCTGCCT GTGTGCATGT TCCCTGTCTG GTGCCCCTTT 2040 
GCCCGCCTCC TGCAAACCTC ACAGGGTCCC CACACAACAG TGCCCTCCAG AAGCAGCCCC 2100 
TCGGAGGCAG AGGAAGGAAA ATGGGGATGG CTGGGGCTCT CTCCATCCTC CTTTTCTCCT 2160 
TGCCTTCGCA TGGCTGGCCT TCCCCTCCAA AACCTCCATT CCCCTGCTGC CAGCCCCTTT 2220 
GCCATAGCCT G ATTTTGGGG AGGAGGAAGG GGCGATTTGA GGGAGAAGGG GAGAAAGCTT 2280 
ATGGCTGGGT CTGOTTTCTT CCCTTCCCAG AGGGTCTTAC TGTTCCAGGG TGGCCCCAGG 2340 
GCAGGCAGGG GCCACACTAT GCCTGCGCCC TGOTAAAGGT GACCCCTGCC ATTTACCAGG 2400 
AGCCCTGGCA TGTTCCTGCC CCACAGGAAT AGAATGGAGG GAGCTCCAGA AACTTTCCAT 2460 
CCCAAAGGCA GTCTCCGTGG TTGAAGCAOA CTGGATTTTT GCTCTGCCCC TGACCCCTTG 2520 
TCCCTCTTTG AGGGAGGGG A GCTATGCTAG GACTCCAACC TCAGGGACTC GGGTGGCCTG 2580 
CGCTAGCTTC TTTTG ATACT GAAAACTTTT AAGGTGGG AG GGTGGCAAGG GATGTGCTTA 2640 
ATAA ATC A AT TCCA AGCCTC AAA A A AAAA A AAAAA A AAAA AA 

Sea ID NO: 50 Protein sequence: 
Protein Accession ft NP_2 19487. 1 

1 11 21 31 41 51 • 

I I I I I I 

MKDVGPESCG QPTPCWPSPA LESVLGKASQ HLGLESGQPL YLLELNWGGT ECALSSTGRT 60 
AACFLPISLL PTSPAAWLGP EALCLPGRPG TTGLRDTGGP LLLPPPTLLQ DTTRWCWMLV 120 
LWPAK VHGDS PHGILRDQAA GIGKEFHPDH CPSQVPRRPH HTPFQGQGSS KPRARILCCC 1 80 
LVESLPPCVG SVGQAECIGD RAVSMGLGVC ELRPRCAVWR RVLSGKRCGF KVCVCRGWVC 

Scq ID NO: 51 DNA sequence 

Nucleic Acid Accession #: XM_059098. 1 

Coding sequence: 1 78-6 1 8(undertined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GATGTACACT CTGAAGTGAG CACATTCCTG TTGGCAGGAC ATGACACCTT GGCAGCAAGC 60 
ATCTCCTGGA TCCTTTACTG CCTGGCTCTG A ACCCTG AGC ATCAAG AGAG ATGCCGGGAG 1 20 
GAGGTCAGGG GCATCCTGGG GGATGGGTCT TCTATCACTT GGGACCAGCT GGQTOAGATG 180 
TCGTACACCA CAATGTGCAT CAAGGAG ACG TGCCGATTG A TTCCTGCAGT CCCGTCCATT 240 
TCCAGAG ATC TCAGCAAGCC ACTTACCTTC CCAGATGGAT GC ACATTGCC TGCAGGOATC 300 
ACCGTGGTTC TTAGTATTTG GGGTCTTCAC CACAACCCTG CTGTCTGGAA AAACCCAAAG 360 
GTCTTTGACC CCTTG AGGTT CTCTCAGGAG AATTCTGATC AGAGACACCC CTATGCCTAC 420 
TTACCATTCT CAGCTGGATC AAGGAACTGC ATTGGGCAGG AGTTTGCCAT GATTG AGTTA 480 
AAGGTAACCA TTGCCTTGAT TCTGCTCCAC TTCAGAGTGA CTCCAGACCC CACCAGGCCT 540 
CTTACTTTCC CCA ACCATTT TATCCTCAAG CCCAAGAATG GGATGTATTT GCACCTGAAG 600 
AAACTCTCTG AATGTTAG AT CTCAGGQTAC AATG ATTAAA CGTACTTTGT TTTTCOA AGT 660 
TAAATTTACA GCTAATGATC CAAGCAGATA GAAAGGGATC AATGTATGGT GGGAGOATTG 720 
GAGGTTGGTG GGATAGGGGT CTCTGTGAAG AGATCCAAAA TCA TTTCTA G GTACACAGTG 780 
TGTCAGCTAG ATCTGTTTCT ATATAACTTT GGGAGATTTT CAGATCTTTT CTGTTAAACT 840 
TTCACTACTA TTAATGCTGT ATACACCAAT AGACTTTCAT ATATTTTCTG TTGTTTTTAA 900 
AATAGTTTTC AGAATTATGC AAGTA ATAAG TGCATGTATG CTCACTGTCA AAAATTCCCA 960 
ACACTAGAAA ATCATGTAGA ATAAAAATTT TAAATCTCAC TTCACTTAGC CGACATTCCA 1020 
TGCCCTGACC AATCCTACTG CTTTTCCTAA AAACAGAATA ATTTGGTGTG CATTCTTTCA 1080 
GACTTTTTCC TATACATTTT ATATGTAGAA ATGTAGCAAT GTATTTGTAT AGATGTGATC 1 140 
ATTCCTATAT TGTTATTGAT TTTTTTCACT TAATAAAAAT TCACCTTATT CCTT 

Scq ID NO: 52 Protein sequence: 
Protein Accession #: XP 059098.1 

1 11 21 31 41 51 
1 I I I I I 

MSYTTMCIKE TCRUPAVPS ISRDLSKPLT FPDGCTLPAG ITVVLSIWGL HHNPAVWKNP 60 
KVFDPLRFSQ ENSDQRHPYA YLPFSAGSRN CIGQEFAMIE LKVTIALILL HFRVTPDPTR 120 
PLTFPNHFIL KPKNGMYLHL KKLSEC 
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Scq ID NO: 53 DNA sequence 

Nucleic Acid Accession #: NM_030916 

Coding sequence: 1-1533 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCCCCTGT CCCTGGGAGC CGAGATGTGG GGGCCTGAGG CCTGGCTGCT GCTGCTGCTA 60 
CTGCTGGCAT CATTTACAGG CCGGTGCCCC GCGGGTGAGC TGGAGA(XTC AGACGTGGTA 120 
ACTGTGGTGC TGGGCCAGGA CGCAAAACTG CCCTGCTTCT ACCGAGGGGA CTCCGGCGAG 180 
CAAGTGGGGC AAGTGGCATG GGCTCGGGTG GACGCGGGCG AAGGCGCCCA GGAACTAGCG 240 
CTACTGCACT CCAAATACGG GCTTCATGTG AGCCCGGCTT ACGAGGGCCG CGTGGAGCAG 300 
CCGCCGCCCC CACGCAACCC CCTGGACGGC TCAGTGCTCC TGCGCAACGC AGTGCAGGCG 360 
GATGAGGGCG AGTACGAGTG CCGGGTCAGC ACCTTCCCCG CCGGCAGCTT CCAGGCGCGG 420 
CTGCGGCTCC GAGTGCTGGT GCCTCCCCTG CCCTCACTGA ATCCTGGTCC AGCACTAGAA 480 
GAGGGCCAGG GCCTGACCCT GGCAGCCTCC TGCACAGCTG AGGGCAGCCC AGCCCCCAGC 540 
GTGACCTGGG ACACGGAGGT CAAAGGCACA ACGTCCAGCC GTTCCTTCAA GCACTCCCGC 600 
TCTGCTGCCG TCACCTCAGA GTTCCACTTG GTGCCTAGCC GCAGCATG AA TGGGCAGCCA 660 
CTG ACTTGTG TGGTGTCCCA TCCTGGCCTG CTCCAGG ACC AAAGG ATCAC CCACATCCTC 720 
CACGTGTCCT TCCTTGCTGA GGCCTCTGTG AGGGGCCTTG AAG ACCAAAA TCTGTGGCAC 780 
ATTGGCAGAG AAGG AGCTAT GCTCAAGTGC CTGAGTGAAG GGCAGCCCCC TCCCTCATAC, 840 
AACTGGACAC GGCTGGATGG GCCTCTGCCC AGTGGGGTAC GAGTGGATGG GGACACTTTG 900 
GGCTTTCCCC CACTGACCAC TGAGCACAGC GGCATCTACG TCTGCCATGT CAGCAATGAG 960 
TTCTCCTCAA GGGATTCTCA GGTCACTGTG GATGTTCTTG ACCCCCAGGA AGACTCTGGG 1020 
AAGCAGGTGG ACCTAGTGTC AGCCTCGGTG GTGGTGGTGG GTGTGATOGC CGCACTCTTG 1080 
TTCTGCCTTC TGGTGGTGGT GGTGGTGCTC ATGTCCCGAT ACCATCGGCG CAAGGCCCAG 1 140 
CAGATGACCC AGAAATATGA GGAGGAGCTG AOCCTGACCA GGGAG AACTC CATCCGGAGG 1200 
CTGCATTCCC ATCACACGGA CCCCAGGAGC CAGCCGGAGG AGAGTGTAGG GCTGAGAGCC 1260 
GAGGGCCACC CTGATAGTCT CAAGGACAAC AGTAGCTGCT CTGTGATGAG TGAAGAGCCC 1320 
GAGGGCCGCA GTTACTCCAC GCTGACCACO GTGAGGGAGA TAGAAACACA GACTGAACTG 1380 
CTGTCTCCAG GCTCTGGGCG GGCCGAGGAG GAGGAAGATC AGGATGAAGG CATCAAACAG 1440 
GCCATGAACC ATTTTGTTCA GGAGAATGGG ACCCTACGGG CCAAGCCCAC GGGCAATGGC 1500 
ATCTACATCA ATGGGCGGGG ACACCTGGTC TGA 

♦ 

Seq ID NO: 54 Protein sequence: 
Protein Accession #: NP_1 12178.1 

1 II 21 31 41 51 
I I I I I I 

MPLSLGAEMW GPEAWLLLLL LLASFTGRCP AGELETSDVV TV VLGQDAKL PCFYRGDSGE 60 
QVGQVAWARV DAGEGAQELA LLHSKYGLHV SPAYEGRVEQ PPPPRNPLDG SVLLRNAVQA 120 
DEGEYECRVS TFPAGSFQAR LRLRVLVPPL PSLNPGPALE EGQGLTLAAS CTAEGSPAPS 180 
VTWDTEVKGT TSSRSFKHSR SAAVTSEFHL VPSRSMNGQP LTCWSHPGL LQDQRJTHIL 240 
HVSFLAEASV RGLEDQNLWH IGREGAMLKC LSEGQPPPSY NWTRLDGPLP SG VRVDGDTL 300 
GFPPLTTEHS G1YVCHVSNB FSSRDSQVTV DVLDPQEDSG KQVDLVSASV VWGVIAALL 360 
FCLLVWWL MSRYHRRKAQ* QMTQKYEEEL TLTRENSIRR LHSHHTDPRS QPEESVGLRA , 420 
EGHPDSLKDN SSCSVMSEEP EGRSYSTLTT VREIETQTEL LSPGSGRAEE EEDQDEGIKQ 480 . 
AMNHFVQENG TLRAKPTGNG IYINGRGHLV 

Seq ID NO: 55 DNA sequence 

Nucleic Acid Accession*: AF00717O.1 

Coding sequence: 73-1 725 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

AAGGAGGCGG CCTCCGGGA A AAGCGACCGC AGGACTCCTG AGAGCAGCCT CCATGAGGCC 60 
CTGGACCAGT GCATGACCGC CCTGGACCTC TTCCTCACCA ACCAGTTCTC AGAAGCACTC 120 
AGCTACCTCA AGCCCAGAAC CAAGGAAAGC ATGTACCACT CACTGACATA TGCCACCATC 180 
CTGGAGATGC AGGCCATGAT GACCTTTGAC CCTCAGGACA TCCTGCTTGC CGGCAACATG 240 
ATGAAGGAGG CACAGATGCT GTGTCAGAGG CACCGGAGGA AGTCTTCTGT AACAGATTCC 300 
TTCAGCAGCC TGGTGAACCG CCCCACGCTG GGCCAATTCA CTG AAG AAG A AATCCACGCT 360 
GAGGTCTGCT ATGCAGAGTG CCTGCTGCAG CGAGCAGCCC TGACCTTCCT GCAGGACGAG 420 
AACATGGTGA GCTTCATCAA AGGCGGCATC AAAGTTCGAA ACAGCTACCA GACCT ACAAG 480 
GAGCTGGACA GCCTTGTTCA GTCCTCACAA TACTGCAAGG GTGAGAACCA CCCGCACTTT 540 
G AAGG AGGAG TGAAGCTTGG TGTAGGGGCC TTCAACCTGA CACTGTCCAT GCTTCCTACT 600 
AGGATCCTGA GGCTGTTGGA GTTTGTGGGG TTTTCAGGAA ACAAGGACTA TGGGCTGCTG 660 
CAGCTGGAGG AGGG AGCGTC AGGGCACAGC TTCCGCTCTG TGCTCTGTGT CATGCTCCTG 720 
CTGTGCTACC ACACCTTCCT CACCTTCGTG CTCGGTACTG GGAACGTCAA CATCGAGGAG 780 
GCCGAG AAGC TCTTG AAGCC CTACCTGAAC CGGTACCCTA AGGGTGCCAT CTTCCTGTTC 840 
TTTGCAGGGA GGATTGAAGT CATTAAAGGC AACATTGATG CAGCCATCCG GCGTTTCGAG 900 
GAGTGCTGTG AGGCCCAGCA GCACTGGAAG CAGTTTCACC ACATGTGCTA CTGGGAGCTG 960 
ATGTGGTGCT TCACCTACAA GGGCCAGTGG AAGATGTCCT ACTTCTACGC CGACCTGCTC 1 020 
AGCAAGGAGA ACTGCTGGTC CAAGGCCACC TACATTTACA TGAAGGCCGC CTACCTCAGC 1080 
ATGTTTGGG A AGG AGGACCA CAAGCCGTTC GGGGACGACG AAGTGG AATT ATTTCG AGCT 1 140 
GTGCCAGGCC TGAAGCTCAA GATTGCTGGG AAATCTCTAC CCACAGAGAA GTTTGCCATC 1200 
CGGAAGTCCC GGCGCTACTT CTCCTCCAAC CCTATCTCGC TGCCAGTGCC TGCTCTGGAA 1260 
ATGATGTACA TCTGG AACGG CTACGCCGTG ATTGGG AAGC AGCCGAAACT CACGGATGGG 1 320 
ATACTTGAGA TTATCACTAA GGCTGAAGAG ATGCTGGAGA AAGGCCCAGA GAACGAGTAC 1380 
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TCAGTGGATG ACOAGTGCTT GGTG AAATTG TTGAAAGGCC TGTGTCTGAA ATACCTGGGC 1440 
CGTGTCCAGG AGGCCGAGGA GAATTTTAGG AGCATCTCTG CCAATGAAAA GAAGATTAAA 1500 
TATGACCACT ACTTGATCCC AAACGCCCTG CTGGAGCTGG CCCTGCTGCT TATGGAGCAA 1560 
GACAGAAACG AAGAGGCCAT CAAACTTTTG GAATCTGCCA AGCAAAACTA CAAGAATTAC 1620 
TCCATGGAGT CAAGGACACA CTTTCGAATC CAGGCAGCCA CACTCCAAGC CAAGTCTTCC 1680 
CTAG AGAACA GCAGCAGATC CATGGTCTCA TCAGTGTCCT TOTAGCTTTO TGCAGCAGTT 1 740 
CCGGGCTGGA AGACAG AGAC AGCTGGACAG AGCTCCTGAA AACATTTCAA AATACCCCCT 1 800 
CCCCCTGCCC TGCCCTGCCT TTGGGGTCCA CCGGCACTCC AGTTGGATGG CACAACATAG 1860 
TGTATCCGTG CAGAAGCCGA GCTGGCATTT TCACCAGTGT AGCCAAGGGC CTTTGCCAAG 1920 
GGCAGAGCAG GTGG AGCCCT CTGCCTGCCC TATCACACAT ACGGGTACTT GCTTTTCACT 1980 
GTGATGTTTA AGAG AATGTA TGAACAGTTT ACATTTTCCT TAGAAATACA TTGATGGGAT 2040 
CACAGTTGGC TTTAAAAACC AACAACAATC AACCACCTGT AAGTCTTTGT CTTCAOCTAT 2100 
TATCATCTGG AGGTAAATCT CTTTATATGA TGATGCCAAA GGGCAAATTG CTTTTCAAAT 2160 
TCAGCAAGTT CTCAGCTTGT GTGACGGAAG GTCCTTCAGA GGACCTGAGG AATGCCTGGG 2220 
AOAGGCTAAG CCTCAGGCTT CAATGCTTCT GGGGTTGGGC ATGAGGATGT ACACAGACAC 2280 
CCACTACCTT ACTACTCACA CTTCATTTCA CTCCTTTTGT AAATTTCCAA TTTAAAAATC 2340 
AAGCACGTCT TTTTAGTGAG ATAAAATCTG AGCTCTTCTG TAGAAAAATC AATCTCTACC 2400 
AGTAGAAAAT GCCAGGGCTT GATGGAAGAG CTGTGTAGCC CTTTCTATGC CAAAGCCAGG 2460 
AAATTTGGGG GGCAGGAGGA GGTTCTCAGA ATCCAGTCTG TATCTTTGCT GTATGCCAAA 2520 
CTG AAACCAC TGGG AATA AT TTATGAAACA TAAAAATCTT CTGTACTTCA CTCCAAGGTA 2580 
CATTTGCTTA CTG ACAGCAT TTTTGTTAAA ACTGTTATTC TTG AAAAAAA AAAAAAAAAA 2640 



Sep ID NO: 56 Protein sequence: 
Protein Accession*: AAC39582.1 

1 11 21 31 41 51 

I I I I I I 

MTALDLFLTN QFSEALSYLK PRTKESMYHS LTYATILEMQ AMMTFDPQDI LLAGNMMKEA 60 
QMLCQRHRRK SSVTDSFSSL VNRPTLGQFT EEEIHAEVCY AECLLQRAAL TFLQDENMVS 120 
FKGGDCVRN SYQTYKELDS LVQSSQYCKG ENHPHFEGGV KLGVGAFNLT LSMLPTRILR 180 
LLEFVGFSGN KDYGLLQLEE GASGHSFRSV LCVMLLLCYH TFLTFVLGTG NVNIEEAEKL 240 
LKPYLNRYPK GAIFLFFAGR IEVKGNIDA AIRRFEECCE AQQHWKQFHH MCYWELMWCF 300 
TYKGQWKMSY FYADLLSKEN CWSKATYIYM KAAYLSMFGK EDHKPFGDDE VELFRAVPGL 360 
KLKJAGKSLP TEKFAIRKSR RYFSSNPISL PVPALEMMYI WNGYAVIGKQ PKLTDGILEI 420 
ITKAEEMLEK GPENEYSVDD ECLVKLLKGL CLKYLGRVQE AEENFRSISA NEKKIKYDHY 480 
UPNALLELA LLLMEQDRNE EADCLLESAK QNYKNYSMES RTHFRIQAAT LQAKSSLENS 540 
SRSMVSSVSL 

Seq ID NO: 57 DNA sequence 

Nucleic Acid Accession #: NM 006670.1 

Coding sequence: 1-927 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I i I 

ATOCCTOGGO GGTGCTCCCG GGGCCCCGCC GCCGGGGACG GGCGTCTGCG GCTGGCGCGA 60 
CTAGCGCTGG TACTCCTGGG CTGGGTCTCC TCGTCTTCTC CCACCTCCTC GGC ATCCTCC 120 
TTCTCCTCCT CGGCGCCGTT CCTGGCTTCC GCCGTGTCCG CCCAGCCCCC GCTGCCGGAC 1 80 
CAGTGCCCCG CGCTGTGCGA GTGCTCCGAG GCAGCGCGCA CAGTCAAGTG CGTTAACCGC 240 
AATCTGACCG AGGTGCCCAC GGACCTGCCC GCCTACGTGC GCAACCTCTT CCTTACCGGC 300 
AACCAGCTGG CCAGCAACCA CTTCCTTTAC CTGCCGCGGG ATGTGCTGGC CCAACTGCCC 360 
AGCCTCAGGC ACCTGGACTT AAGTAATAAT TCGCTGGTGA GCCTGACCTA CGTGTCCTTC 420 
CGCAACCTG A CACATCTAG A AAGCCTCCAC CTGGAGGACA ATGCCCTCAA GGTCCTTCAC 480 
AATGGCACCC TGGCTGAGTT GCAAGGTCTA CCCCACATTA GGGTTTTCCT GGACAACAAT 540 
CCCTGGGTCT GCG ACTGCCA CATGGCAGAC ATGGTGACCT GGCTCAAGGA AACAGAGGTA 600 
GTGCAGGGCA AAGACCGGCT CACCTGTGCA TATCCGGAAA AAATGAGGAA TCGGGTCCTC 660 
TTGGAACTCA ACAGTGCTG A CCTGGACTGT GACCCGATTC TTCCCCCATC CCTGCAAACC 720 
tCTTATGTCT TCCTGGGTAT TGTTTTAGCC CTGATAGGCG CTATTTTCCT OCTGGTTTTG 780 
TATTTG AACC GCAAGGGG AT AAAAAAGTGG ATGCATAACA TCAGAGATGC CTGCAGGGAT 840 
CACATGG AAG GGTATCATTA CAGATATG AA ATCAATGCGG ACCCCAG ATT AACAAACCTC 900 
AGTTCTAACT CGG ATGTCCT CGAGJGA 

Seq ID NO: 58 Protein sequence 
Protein Accession #: NPJJ06661.1 



1 11 21 31 41 51 
I I I I I I 

MPGGCSRGPA AGDGRLRLAR LALVLLGWVS SSSPTSSASS FSSSAPFLAS AVSAQPPLPD 60 
QCPALCECSE AARTVKCVNR NLTEVPTDLP AYVRNLFLTG NQLASNHFLY LPRDVLAQLP 120 
SLRHLDLSNN SLVSLTYVSF RNLTHLESLH LEDNALKVLH NGTLAELQGL PHIRVFLDNN 1 80 
PWVCDCHMAD MVTWLKJETEV VQGKDRLTCA YPEKMRNRVL LELNSADLDC DPILPPSLQT 240 
SYVFLGIVLA LIG AIFLLVL YLNRKGDCKW MHNIRDACRD HMEGYHYRYE INADPRLTNL 300 
SSNSDVLE 



Seq ID NO: 59 DNA sequence 

Nucleic Acid Accession #: . NM_024022 
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Coding sequence: 1 - 1 362(imderiined sequences correspond to start and stop codons) 
1 11 21 3! 41 51 

ATGGGGGAAA ATGATCCGCC TGCTGTtGAA GCCCCCTTCT CATTCCQATC GCTTTTTGGC 60 
CTTGATGATT TGAAAATAAG TCCTGTTGCA CCAGATGCAG ATGCTGTTGC TGCACAGATC 120 
CTGTCACTGC TGCCATTGAA GTTTTTTCCA ATCATCGTCA TTGGGATCAT TGCATTGATA 1 80 
TTAGCACTGG CCATTGGTCT GGGCATCCAC TTCG ACTGCT CAGGG AAGTA CAG ATGTCGC 240 
TCATCCTTTA AGTGTATCGA GCTGATAGCT CG ATGTGACG GAGTCTCGG A TTGCAAAGAC 300 
GGGG AGGACG AGTACCGCTG TGTCCGGGTG GGTGGTCAGA ATGCCGTGCT CCAGGTGTTC 360 
ACAGCTGCTT CGTGG AAGAC CATGTGCTCC GATGACTGGA AGGGTCACTA CGCAAATGTT 420 
GCCTGTGCCC AACTGGGTTT COCAAGCTAT GTG AGTTCAG ATAACCTCAG AGTGAGCTCG 480 
CTGGAGGGGC AGTTCCGGGA GGAGTTTGTG TCCATCGATC ACCTCTTGCC AGATOACAAG 540 
GTGACTGCAT TACACCACTC AGTATATGTG AGGGAGGGAT GTGCCTCTGG CCACGTGGTT 600 
ACCTTGCAGT GCACAGCCTG TGGTCATAGA AGGGGCTACA GCTCACGCAT CGTGGGTGGA 660 
AACATGTCCT TGCTCTCGCA GTGGCCCTOG CAGGCCAGCC TTCAGTTCCA GGGCTACCAC 720 
CTGTGCGGGG GCT CTGTCAT CACGCCCCTG TGGATCATCA CTGCTGCACA CTGTGTTTAT 780 
GACTTGTACC TCCCCAAGTC ATGGACCATC CAGGTGGGTC TAGTTTCCCT GTTGGACAAT 840 
CCAGCCCCAT CCCACTTGGT GGAGAAGATT GTCTACCACA GCAAGTACAA GCCAAAGAGG 900 
CTGGGCAATG ACATCGCCCT TATG AAGCTG GCCGGGCCAC TCACGTTCAA TGAAATGATC . 960 
CAGCCTGTGT GCCTGCCCAA CTCTGAAGAG AACTTCCCCG ATGGAAAAGT GTGCTGGACG 1020 
TCAGGATGGG GGGCCACAGA GGATGGAGGT GACGCCTCCC CTGTCCTGAA CCACGCGGCC 1080 
GTCCCTTTGA TTTCCAACAA GATCTGCAAC CACAGGGACG TGTACGGTGG CATCATCTCC 1140 
CCCTCCATGC TCTGCGCGGG CTACCTGACG GGTGGCGTGG ACAGCTGCCA GGGGGA CAGC 1200 
GGGGGGCCCC TGGTGTGTCA AG AG AGO AGO CTGTGGAAGT TAGTGGGAGC GACCAGCTTT 1260 
GGCATCGGCT GCGCAGAGGT GAACAAGCCT GGGGTGTACA CCCGTGTCAC CTCCTTCCTG 1320 
G ACTGGATCC ACGAGCAGAT GGAGAGAGAC CTAAAAACCTGA 

Seq ID NO: 60 Protein sequence 
Protein Accession #: NP_076927 

1 II 21 31 41 51 
I i I I I I 

MGENDPPAVE APFSFRSLFG LDDLKISPVA PDADAVAAQI LSLLPUCFFP IIVIGIIALI 60 ' 
LALAIGLGIH FDCSGKYRCR SSFKCIEUA RCDGVSDCKD GEDEYRCVRV GGQNAVLQVF 120 
TAASWKTMCS DDWKGHYANV ACAQLGFPSY VSSDNLRVSS LEGQFREEFV SIDHLLPDDK . 180 
VTALHHSVYV REGCASGHW TLQCTACGHR RGYSSRIVGG NMSLLSQWPW QASLQFQGYH 240 
LCGGSV1TPL WIITAAHCVY DLYLPKSWTI QVGLVSLLDN PAPSHLVEK1 VYHSKYKPKR 300 
LGNDIALMKL AGPLTFNEMI QPVCLPNSEE NFPDGKVCWT SGWGATEDGG DASPVLNHAA 360 
VPLISNKICN HRDVYGGnS PSMLCAGYLT GGVDSCQGDS GGPLVCQERR LWKLVG ATSF 420 
GIGCAEVNKP GVYTRVTSFL DWIHEQMERD LKT 

Seq ID NO: 61 DNA sequence 

Nucleic Acid Accession #: NM.006475 

Coding sequence: 28-2538 (underlined sequences correspond to start and stop codons) 

1 11 21 .31 41 51 

I I I I 1 I 

AACAG AACTG CAACGGAG AG ACTCAAGATG ATTCCCTTTT TACCCATGTT TTCTCTACTA 60 
TTGCTGCTT A TTGTTAACCC TATAAACGCC AACAATCATT ATG ACAAG AT CTT GGCT CAT 120 
AGTCGTATCA GGGGTCGGGA CCAAGGCCCA AATGTCTGTG CCCTTCAACA GATTTTGGGC 180 
ACCAAAAAGA AATACTTCAG CACTTGTAAG AACTGGTATA AAAAGTCCAT CTGTGGACAG 240 
AAAACGACTG TTTTATATGA ATGTTGCCCT GGTTATATGA GAATGGAAGG AATGAAAGGC 300 
TGCCCAGCAG TTTTGCCCAT TGACCATGTT TATGGCACTC TGGGCATCGT GGGAGCCACC 360 
ACAACGCAGC GCTATTCTGA CGCCTCAAAA CTGAGGGAGG AGATCGAGGG AAAGGGATCC 420 
TTCACTTACT TTGCACCGAG TAATGAGGCT TGGGACAACT TGGATTCTGA TATCCGTAGA 480 
GGTTTGGAGA GCAACGTGAA TGTTGAATTA CTGAATGCTT TACATAGTCA CATGATTAAT 540 
AAGAG AATGT TGACCAAGGA CTTAAAAAAT GGCATGATTA TTCCTTCAAT GTATAACAAT 600 
TTGGGGCTTT TCATTAACCA TTATCCTAAT GGGGTTGTCA CTGTTAATTG TGCTCGAATC 660 
ATCCATGGGA ACCAGATTGC AACAAATGGT GTTGTCCATG TCATT GACC G TG TGCTT ACA 720 
CAAATTGGTA CCTCAATTCA AGACTTCATT GAAGCAGAAG ATGACCTTTC ATCTTTTAGA 780 
GCAGCTGCCA TCACATCGGA CATATTGGAG GCCCTTGGAA GAGACGGTCA CTTCACACTC 840 
TTTGCTCCCA CCAATG AGGC TTTTGAGAAA CTTCCACGAG GTGTCCTAGA AAGGTTCATG 900 
GG AGACA AAG TGGCTTCCG A AGCTCTTATG AAGTACCACA TCTTAAATAC TCTCCAGTGT 960 
TCTGAGTCTA TTATGGGAGG AGCAGTCTTT GAGACGCTGG AAGGAAATAC AATTGAGATA 1020 
GGATGTGACG GTGACAGTAT AACAGTAAAT GGAATCAAAA TGGTGAACAA AAAGGATATT 1080 
GTGACAAATA ATGGTGTGAT CCATTTGATT GATCAGGTCC TAATTCCTGA TTCTGCCAAA 1 140 
CAAGTTATTG AGCTGGCTGG AAAACAGCAA ACCACCTTCA CGGATCTTGT GGCCCAATTA 1200 
GGCTTGGCAT CTGCTCTGAG GCCAGATGGA GAATACACTT TGCTGGCACC TGTGAATAAT 1260 
GCATTTTCTG ATGATACTCT CAGCATGGTT CAG CGCCT CC TTAAATTAAT TCTGCAGAAT 1320 
CACATATTGA AAGTAAAAGT TGGCCTTAAT GAGCTTTACA ACGGGCAAAT ACTGGAAACC 1380 
ATCGGAGGCA AACAGCTCAG AGTCTTCGTA TATCGTACAG CTGTCTGCAT TGAAAATTCA 1440 
TGCATGGAGA AAGGGAGTAA GCAAGGGAGA AACGGTGCGA TTCACATATT CCGC GAGATC 1500 
ATCAAGCCAG CAGAGAAATC CCTCCATGAA AAGTTAAAAC AAGATAAGCG CTTTAGCACC 1560 
TTCCTCAGCC TACTTGAAGC TGCAGACTTG AAAGAGCTCC TGACACAACC TGGAGACTGG 1620 
ACATTATTTG TGCCAACCAA TGATGCTTTT AAGGGAATGA CTAGTOAAGA AAAAGAAATT 1680 
CTGATACGGG ACAAA AATGC TCTTCAAAAC ATCATTCTTT ATCACCTGAC ACCAGGAGTT 1740 
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TTCATTGGAA AAGGATTTGA ACCTGGTGTT ACTAACATTT TAAAGACCAC ACAAGGAAGC 1800 - 
AAAATCTTTC TGAAAGAAGT AAATGATACA CTTCTGGTGA ATGAATTGAA ATCAAAAGAA 1860 
TCTGACATCA TGACAACAAA TGGTGTAATT CATGTTGTAG ATAAACTCCT CTATCCAGCA 1920 
GACACACCTG TTGGAAATGA TCAACTGCTG GAAATACTTA ATAAATTAAT CAAATACATC 1980 
CAAATTAAGT TTGTTCGTGG TAGCACCTTC AAAGAAATCC CCGTG ACTGT CTATACAACT 2040 
AAAATTATAA CCAAAGTTGT GGAACCAAAA ATTAAAGTGA TTGAAGGCAG TCTTCAGCCT 2 100 
ATTATCAAAA CTGAAGGACC CACACTAACA AAAGTCAAAA TTGAAGGTGA ACCTGAATTC 2160 
AGACTGATTA AAGAAGGTGA AACAATAACT GAAGTGATCC ATGGAGAGCC A ATT ATT AAA 2220 
AAATACACCA AAATCATTGA TGGAGTGCCT GTGGAAATAA CTGAAAAAGA GACACGAGAA 2280 
GAACGAATCA TTACAGGTCC TGAAATAAAA TACACTAGGA TTTCTACTGG AGGTGGAGAA 2340 
ACAGAAG AAA CTCTGAAGAA ATTGTTACAA GAAG AGGTCA CCAAGGTCAC CAAATTCATT 2400 
GAAGGTGGTG ATGGTCATTT ATTTGAAGAT GAAGAAATTA AAAGACTGCT TCAGGGAGAC 2460 
ACACCCGTGA GGAAGTTGCA AGCCAACAAA AAAGTTCAAG GTTCTAGAAG ACGATTAAGG 2520 
GAAGGTCGTT CTCAGJGAAA ATCCAAAAAC CAGAAAAAAA TGTTTATACA ACCCTAAGTC 2580 
AATAACCTGA CCTTAG AAAA TTGTGAGAGC CAAGTTGACT TCAGGAACTG AAACATCAGC 2640 
ACAAAGAAGC AATCATCAAA TAATTCTGAA CACAAATTTA ATATTTTTTT TTCTGAATG A 2700 
GAAACATGAG GGAAATTGTG GAGTTAGCCT CCTGTGGTAA AGGAATTGAA GAAAATATAA 2760 
CACCTTACAC CCTTTTTCAT CTTGACATTA AAAGTTCTGG CTAACTTTGG AATCCATTAG 2820 
AG AAAAATCC TTGTCACCAG ATTCATTACA ATTCAAATCG AAG AGTTGTG AACTGTTATC 2880 
CCATTGAAAA GACCGAGCCT TGTATGTATG TTATGGATAC ATAAAATGCA CGCAAGCCAT 2940 
TATCTCTCCA TGGG AAGCTA AGTTATAAAA ATAGGTGCTT GGTGTACAAA A CT1T11A TA 3000 
TCAAAAGGCT TTGCACATTT CTATATGAGT GGGTTTACTG GTAAATTATG TTATTTTTTA 3060 
CAACTAATTT TGTACTCTCA GAATGTTTOT CATATGCTTC TTGCAATGCA TATTTTTTAA 3120 
TCTCAAACGT TTCAATAAAA CCATTTTTCA GATATAAAGA GAATTACTTC AAATTGAGTA 3180 
ATTCAGAAAA ACTCAAGATT TAAGTTAAAA AGTGGTTTGG ACTTGGGAA 

Seq ID NO: 62 Protein sequence: 
Protein Accession U; NP_006466 

1 11 21 31 41 51 
I I I I I I 

MIPFLPMFSL LLLLIVNPIN ANNHYDKILA HSRIRGRDQG PNVCALQQDL GTKKKYFSTC 60 
KNWYKKSICG QKTTVLYECC PGYMRMEGMK GCPAVLPIDH VYGTLGIVGA TTTQRYSDAS 120 
KLREEIEGKG SFTYFAPSNE AWDNLDSD1R RGLESNVNVE LLNALHSHMI NKRMLTKDLK 180 
NGMIIPSMYN NLGLFINHYP NG WTVNCAR IIHGNQIATN GWHVIDRVL TQIGTSIQDF 240 
IEAEDDLSSF RAAAITSDIL EALGRDGHFT LFAPTNEAFE KLPRG VLERF MGDKVASEAL 300 
MKYHILNTLQ CSESIMGGAV FETLEGNTIE IGCDGDSITV NGIKMVNKKD IVTNNGVIHL 360 
IDQVLIPDSA KQVIELAGKQ QTTFTDLVAQ LGLASALRPD GEYTLLAPVN NAFSDDTLSM 420 
VQRLLKLILQ NHOJCVKVGL NELYNGQILE TIGGKQLRVF VYRTAVdEN SCMEKGSKQG 480 
RNG AIHIFRE HKPAEKSLH EKLKQDKRFS TFLSLLEA AD LKELLTQPGD WTLFVPTNDA 540 
FKGMTSEEKE ILIRDKNALQ NIILYHLTPG VFIGKGFEPG VTNDLKTTQG SKIFLKEVND 600 
TLLVNELKSK ESDIMTTNG V IHWDKLLYP ADTPVGNDQL LEILNKLKY IQIKFVRGCT 660 
FKEIPVTVYT TKIITKWEP KSKVIEGSLQ PIIKTEGPTL TKVKIEGEPE FRUKEGETI 720 
TEVIHGEPH KKYTKJIDGV PVEITEKETR EERHTGPEI KYTRISTGGG ETEETLKKLL 780 
QEEVTKVTKF IEGGDGHLFE DEFJKRLLQG DTPVRKLQAN KKVQGSRRRL REGRSQ 

Seq ID NO: 63 DNA sequence 

Nucleic Acid Accession #: NM_020974 

Coding sequence: 81-3080 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

I I I I I I 

GGCGTCCGCG CACACCTCCC CGCGCCGCCG CCGCCACCGC CCGCACTCCG CCGCCTCTGC 60 
CCGC AACCGC TG AGCCATCC ATQGGGGTCG CGGGCCGCAA CCGTCCCGGG GCGGCCTGGG 120 
CGGTGCTGCT GCTGCTGCTG CTGCTGCCGC CACTGCTGCT GCTGGCGGGG GCCGTCCCGC 180 
CGGGTCGGGG CCGTGCCGCG GGGCCGCAGG AGGATGTAGA TGAGTGTGCC CAAGGGCTAG 240 
ATGACTGCCA TGCCGACGCC CTGTGTCAGA ACACACCCAC CTCCTACAAG TGCTCCTGCA 300 
AGCCTGGCTA CCAAGGGGAA GGCAGGCAGT GTGAGGACAT GGATGAATGT GGAAATGAGC 3(50 
TCAATGG AGG CTGTGTCCAT GACTGTTTGA ATATTCCAGG CAATTATCGT TGCACTTGTT 420 
TTG ATGGCTT CATGTTGGCT CATGACGGTC ATAATTGTCT TGATGTGGAC GAGTGOCTGG 480 
AGAACAATGG CGGCTGCCAG CATACCTGTG TCAACGTCAT GGGGAGCTAT GAGTGCTGCT 540 
GCAAGGAGGG GTTTTTCCTG AGTGACAATC AGCACACCTG CATTCACCGC TCGGAAGAGG 600 
GCCTG AGCTG CATG AATAAG GATCACGGCT GTAGTCACAT CTGCAAGGAG GCCCCAAOGG 660 
GCAGCGTCGC CTGTGAGTGC AGGCCTGGTT TTGAGCTGGC CAAGAACCAG AGAGACTGCA 720 
TCTTGACCTG TAACCATGGG AACGGTGGGT GCCAGCACTC CTGTGACGAT ACAGCCGATG 780 
GCCCAGAGTG CAGCTGCCAT CCACAGTACA AGATGCACAC AGATGGGAGG AGCTGCCTTG 840 
AGCGAG AGGA CACTGTCCTG GAGGTGACAG AGAGCAACAC CACATCAGTG GTGGATGGGG 900 
ATAAACGGGT GAAACGGCGG CTGCTCATGG AAACGTGTGC TGTCAACAAT GGAGGCTGTG 960 
ACCGCACCTG TAAGGATACT TCGACAGGTG TCCACTGCAG TTGTCCTGTT GGATTCACTC 1020 
TCCAGTTGGA TGGGAAGACA TGTAAAGATA TTGATGAGTG CCAGACCCGC AATGGAGGTT 1080 
GTGATCATTT CTGCAAAAAC ATCGTGGGCA GTTTTGACTG CGGCTGCAAG AAAGGATTTA 1 140 
AATTATTAAC AGATGAGAAG TCTTGCCAAG ATGTGGATGA GTGCTCTTTG GATAGGACCT 1200 
GTGACCACAG CTGCATCAAC CACCCTGGCA CATTTGCTTG TGCTTGCAAC CGAGGGTACA 1260 
CCCTGTATGG CTTCACCCAC TGTGGAGACA CCAATGAGTG CAGCATCAAC AACGGAGGCT 1320 
GTCAGCAGGT CTGTGTGAAC ACAGTGGGCA GCTATGAATG CCAGTGCCAC CCTGGGTACA 1380 
AGCTCCACTG GAATAAAAAA GACTGTGTGG AAGTGAAGGG GCTCCTGCCC ACAAGTGTGT 1440 
CACCCCGTGT GTCCCTGCAC TGCGGTAAGA GTGGTGGAGG AGACGGGTGC TTCCTCAOAT 1500 
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GTCACTCTGG CATTCACCTC TCTTCAGATG TCACCACCAT CAGGACAAGT GTAACCTTTA 1560 
AGCTAAATG A AG GCA AGTGT AGTTTG A AA A ATGCTGAGCT GTTTCCCG AG GGTCTGCG AC 1 620 
CAGCACTACC AGAGAAGCAC AGCTCAGTAA AAGAGAGCTT CCGCTACGTA AACCTTACAT 1680 
GCAGCTCTGG CAAGCAAGTC CCAGGAGCCC CTGGCCGACC AAGCACCCCT AAGGAAATGT 1740 
TTATCACTGT TGAGTTTGAG CTTGAAACTA ACCAAAAGGA GGTGACAGCT TCTTGTG ACC 1 800 
TG AGCTGCAT CGTAAAGCGA ACCGAGAAGC GGCTCCGTAA AGCCATCGGC ACGCTCAGAA 1 860 
AGGCCGTCCA CAGGGAGCAG TTTCACCTCC AGCTCTCAGG CATGAACCTC GACGTGGCTA 1920 
AAAAGCCTCC CAGAACATCT GAACGCCAGG CAGAGTCCTG TGGAGTGGGC CAGGGTCATG 1980 
CAGAAAACCA ATGTGTCAGT TGCAGGGCTG GGACCTATTA TGATGGAGCA CGAGAACGCT 2040 
GCATTTTATG TCCAAATGGA ACCTTCCAAA ATGAGGAAGG ACAAATGACT TGTGAACCAT 2100 
GCCCAAGACC AGGAAATTCT GGGGCCCTGA AGACCCCAGA AGCTTGGAAT ATGTCTGAAT 2160 
GTGGAGGTCT GTGTCAACCT GGTGAATATT CTGCAGATGG CTTTGCACCT TGCCAGCTCT 2220 
GTGCCCTGGG CACGTTCCAG CCTGAAGCTG GTCGAACTTC CTGCTTCCCC TGTGGAGGAG 2280 
GCCTTGCCAC CAAACATCAG GGAGCTACTT CCTTTCAGGA CTGTGAAACC AGAGTTCAAT 2340 
GTTCACCTGG ACATTTCTAC AACACCACCA CTCAOCGATG TATTCGTTGC CCAGTGGGAA 2400 
C AT ACCAGCC TGAATTTGG A A AAAATAATT GTGTTTCTTG CCCAGGAAAT ACTACG ACTG 2460 
ACTTTGATGG CTCCACAAAC ATAACCCAGT GTAAAAACAG AAGATGTGGA GGGGAGCTGG 2520 
GAGATTTCAC TGGGTACATT GAATCCCCAA ACTACCCAGG CAATTACCCA GCCAACACCG 2580 
AGTGTACGTG G ACCATCAAC CCACCCCCCA AGCGCCGCAT CCTGATCGTG GTCCCTGAGA 2640 
TCTTCCTGCC CATAG AGG AC GACTGTGGGG ACTATCTGGT GATGCGGAAA ACCTCTTCAT 2700 
CCAATTCTGT GACAACATAT G AAACCTGCC AGACCTACGA ACGCCCCATC GCCTTCACCT 2760 
CCAGGTCAAA GAAGCTGTGG ATTCAGTTCA AGTCCAATG A AGGG AACAGC GCTAGAGGGT 2820 
TCCAGGTCCC ATACGTG ACA TATGATGAGG ACTACCAGGA ACTCATTGAA GACATAGTTC 2880 
G AGATGGC AG GCTCTATGCA TCTGAG AACC ATCAGGAAAT ACTTAAGGAT AAG AAACTTA 2940 
TCAAGGCTCT GTTTGATGTC CTGGCCCATC CCCAGAACTA TTTCAAGTAC ACAGCCCAGG 3000 
AGTCCCGAGA GATGTTTCCA AGATCGTTCA TCCGATTGCT ACGTTCCAAA GTGTCCAGGT 3060 
TTTTGAGACC TTACAAATGA CTCAGCCCAC GTGCCACTCA ATACAAATGT TCTGCTATAG 3120 
GGTTGGTGGG ACAGAGCTGT CTTCCTTCTG CATGTCAGCA CAGTCGGGTA TTGCTGCCTC 3180 
CCGTATCAGT GACTCATTAG AGTTCAATTT TTATAGATAA TACAGATATT TTGGTAAATT 3240 
GAACTTGGTT TTTCTTTCCC AGCATCGTGG ATGTAGACTG AGAATGGCTT TGAGTGGCAT 3300 
CAGCTTCTCA CTGCTGTGGG CGGATGTCTT GGATAGATCA CGGGCTGGCT GAGCTGGACT 3360 
TTGGTCAGCC TAGGTGAGAC TCACCTGTCC TTCTGGGGTC TTACTCCTCC TCAAGGAGTC 3420 
TGTAGTGGAA AGGAGGCCAC AGAATAAGCT GCTTATTCTG AAACTTCAGC TTCCTCTAGC 3480 
CCGGCCCTCT CTAAGGGAGC CCTCTGCACT CGTGTGCAGG CTCTGACCAG GCAGAACAGG 3540 
CAAGAGGGGA GGGAAGGAGA CCCCTGCAGG CTCCCTCCAC CCACCTTG AG ACCTGGGAGG 3600 
ACTCAGTTTC TCCACAGCCT TCTCCAGCCT GTGTGATACA AGTTTGATCC CAGGAACTTG 3660 
AGTTCTAAGC AGTGCTCGTG AAAAAAAAAA GCAGAAAGAA TTAGAAATAA ATAAAAACTA 3720 
AGCACTTCTG GAGACAT 

Seg ID NO: 64 Protein sequence: 
Protein Accession #: NP_066025.I 

1 11 21 31 41 51 

I II I I I 

MGVAGRNRPG AAWAVLLLLL LLPPLLLLAG AVPPGRGRAA GPQEDVDECA QGLDDCHADA 60 
LCQNTPTSYK CSCKPGYQGE GRQCEDIDEC GNELNGGCVH DCLNIPGNYR CTCFDGFMLA 120 
HDGHNCLDVD ECLENKGGCQ HTCVNVMGSY ECCCKEGFFL SDNQHTCIHR SEEGLSCMNK 1 80 
DHGCSHICKE APRGSVACEC RPGFELAKNQ RDCILTCNHG NGGCQHSCDD TADGPECSCH 240 
PQYKMHTDGR SCLEREDTVL EVTESNTTSV VDGDKRVKRR LLMETCAVNN GGCDRTCKDT 300 
STGVHCSCPV GFTLQLDGKT CKDIDECQTR NGGCDHFCKN IVGSFDCGCK KGFKLLTDEK 360 
SCQDVDECSL DRTCDHSCIN HPGTFACACN RGYTLYGFTH CGDTNECSIN NGGCQQVCVN 420 
TVGSYECQCH PGYKLHWNKK DCVEVKGLLP TSVSPRVSLH CGKSGGGDGC FLRCHSGIHL 480 
SSDVTTIRTS VTFKLNEGKC SLKNAELFPE GLRPALPEKH SSVKESFRYV NLTCSSGKQV 540 
PGAPGRPSTP KEMFITVEFE LETNQKEVTA SCDLSCIVKR TEKRLRKAIR TLRKAVHREQ 600 
FHLQLSGMNL DVAKKPPRTS ERQAESCGVG QGHAENQCVS CRAGTYYDGA RERCILCPNG 660 
TFQNEEGQMT CEPCPRPGNS GALKTPEAWN MSECGGLCQP GEYSADGFAP CQLCALGTFQ 720 
PEAGRTSCFP CGGGLATKHQ GATSFQDCET RVQCSPGHFY NTTTHRCIRC PVGTYQPEFG 780 
KNNCVSCPGN TTTDFDGSTN ITQCKNRRCG GELGDFTGY1 ESPNYPGNYP ANTECTWTIN 840 
PPPKRRILIV VPEIFLPIED DCGDYLVMRK TSSSNSVTTY ETCQTYERPI AFTSRSKKLW 900 
IQFKSNEGNS ARGFQVPYVT YDEDYQELIE DIVRDGRLYA SENHQEILKD KKLKALFDV 960 
LAHPQNYFKY TAQESREMFP RSFIRLLRSK VSRFLRPYK 

Seq ID NO: 65 DNA sequence 

Nucleic Acid Accession #: NMJJ07210 

Coding sequence: 1-1869 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I t I ! I ' 
ATG AGGCTCC TCCGCAG ACG CCACATGCCC CTGCGCCTGG CCATGGTGGG CTGCGCCTTT 60 
GTGCTCTTCC TCTTCCTCCT GCATAGGGAT GTGAGCAGCA GAGAGGAGGC CACAGAGAAG 120 
CCGTGGCTG A AGTCCCTGGT GAGCCGGA AG G ATCACGTCC TGG ACCTCAT GCTGG AGGCC 1 80 
ATGAACAACC TTAGAGATTC AATGCCCAAG CTCCAAATCA GGGCTCCAG A AGCCCAGCAG 240 
ACTCTGTTCT CCATAAACCA GTCCTGCCTC CCTGGGTTCT ATACCCCAGC TGAACTGAAG 300 
CCCTTCTGGG AACGGCCACC ACAGGACCCC AATGCCCCTG GGGCAGATGG AAAAGCATTT 360 
CAOAAGAGCA AGTGGACCCC CCTGGAGACC CAGGAAAAGG AAGAAGGCTA TAAGAAGCAC 420 
TOTTTCAATG CCTTTGCCAG CGACCGGATC TCCCTGCAGA GGTCCCTGGG GCCAGACACC 480 
CGACCACCTG AGTGTGTGGA CCAGAAGTTC CGGCGCTGCC CCCCACTGGC CACCACCAGC 540 
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. ...OTGATCATTG TGTTCCACAA CGAAGCCTGG TCCACACTGC TGCGAACAGT GTACAGCGTC 600 
CTACACACCA CCCCTGCCAT CTTGCTCAAG GAGATCATAC TGGTGGATGA TGCCAGCACA 660 
GAGGAGCACC TAAAGGAGAA GCTGGAGCAG TACGTGAAGC AGCTGCAGGT GGTGAGGGTG 720 
GTGCGGCAGG AGGAGOGGAA GGGGTTGATC ACCGCCCGGC TGCTGGGGGC CAGCGTGGCA 780 
CAGGCGGAGG TGCTCACGTT CCTGGATGCC CACTGTGAGT GCTTCCACGG CTGGCTGGAG 840 
CCCCTCCTGG CTCGAATCGC TGAGGACAAG ACAGTGGTGG TGAGCCCAGA CATCGTCACC 900 
ATCGACCTTA ATACTTTTGA GTTCGCCAAG CCCGTCCAGA GGGGCAGAGT CCATAGCCGA 960 
GGCAACTTTG ACTGGAGCCT GACCTTCGGC TGGGAAACAC TTCCTCCACA TGAGAAGCAG 1020 
AGGCGCAAGG ATGAAACATA CCCCATCAAA TCCCCGACGT TTGCTGGTGG CCTCTTCTCC 1080 
ATCCCCAAGT CCTACTTTGA GCACATCGGT ACCTATGATA ATCAG ATGG A GATCTGGGGA 1 140 
GGGGAGAACG TGGAAATGTC CTTCCGGGTG TGGCAGTGTG GGGGCCAGCT GGAGATCATC 1200 
CCCTGCTCTG TCGTAGGCCA TGTGTTCCGG ACCAAGAGCC CCCACACCTT CCCCAAGGGC 1260 
ACTAGTGTCA TTGCTCGCAA TCAAGTGCGC CTGGCAGAGG TCTGGATGGA CAGCTACAAG 1320 
AAGATTTTCT ATAGGAGAAA TCTGCAGGCA GCAAAGATGG CCCAAGAGAA ATCCTTCGGT 1380 
GACATTTCGG AACGACTGCA GCTGAGGGAA CAACTGCACT GTCACAACTT TTCCTGGTAC 1440 
CTGCACAATG TCTACCCAG A GATGTTTGTT CCTGACCTG A CGCCCACCTT CTATGGTGCC 1 500 
ATCAAGAACC TCGGCACCAA CCAATGCCTG GATGTGGGTG AGAACAACCG CGGGGGGAAG 1560 
CCCCTCATCA TGTACTCCTG CCACGGCCTT GGCGGCAACC AGTACTTTGA GTACACAACT 1620 
CAGAGGGACC TTCGCCACAA CATCGCAAAG CAGCTGTGTC TACATGTCAG CAAGGGTGCT 1680 
CTGGGCCTTG GGAGCTGTCA CTTCACTGGC AAGAATAGCC AGGTCCCCAA GGACGAGGAA 1740 
TGGGAATTGG CCCAGGATCA GCTCATCAGG AACTCAGGAT CTGGTACCTG CCTGACATCC 1800 
CAGGACAAAA AGCCAGCCAT GGCCCCCTGC AATCCCAGTG ACCOCCATCA GTTGTGGCTC 1860 
TTTGTCIAGG ACCCAGATCA TCCCCAGAGA GAGCCCCCAC AAGCTCCTCA GGAAACAGGA 192P 
TTGCTGATGT CTGGGAACCT GATCACCAGC TTCTCTGGAG GCCGTAAAGA TGGATTTCTA 1 980 
AACCCACTGG GTGGCAAGGC AGGACCTTCC TAATCCTTGC AACAACATTG GGCCCATTTT 2040 
CTTTCCTTCA CACCGATGGA AGAGACCATT AGGACATATA TTTAGCCTAG CGTTTTCCTG 2100 
TTCTAGAAAT AGAGGCTCCC AAAGTAGGGA AGGCAGCTGG GGGAGGGTTC AGGGCAGCAA 2160 
TGCTGAGTTC AAGAAAAGTA CTTCAGGCTG GGCACAGTGG CTCATGCCTG AAATCCTAGC 2220 
ACTTTGGG AA GACAATGTOG G AGAATGGCT TGAGCCCAGG AGTTCAAGAC CGGCCTG AGC 2280 
AACATAGTGA GGATCCCATC TCTACGCCCA CCCTCCCCCC GGCAAAAAAA AAAGCTGGGT 2340 
ATGGTGGCTT ATGCCTGTAG TCGCAGCTAC TCAG AAGGCT GAGGTGGGAG GATTGCTTGT 2400 
TCCCCGGAGG TTGAAGCTAC AGTGAGCCTT GATTGTGTCA CTGCACTCCA GCCTGGGCAA 2460 
CAGGTAAG AC TCTGTCTCAA AAAAAAAACA AAAAAGAAGA AGAAAAGTAC TTCTACAGCC 2520 
ATGTCCTATT CCTTGATCAT CCAAAGCACC TGCAGAGTCC AGTGAAATG A TATATTCTGG 2580 
CTGGGCACAG TGGCTCACAC CTGTAATCCT AGCACTTTGG GAGGCCAAGG CAGGTGGATC 2640 
ACCTGAGGTC AGAAGTTTGA AACCAGCCTG GACTACATGG TGAAACTCCA TCTCTACTAA 2700 
AAGTACAAAA ATTAGCTGGG CATGATGGCA CGCACCTGCA GTCCCAGCTA CTTGGGAGGC 2760 
TGAGGCAGGA GAATCACTCG AAOCCAGGAG GCAGAGGTTG CAGTGAGCCA AGACAGCACC 2820 
ATTGCACCCC AQCCTGAGCA ACAAGAGCGA AACTCCATCT CAGGAAAAAA AAAAAAAAAA 2880 
A 

Seq ID NO: 66 Protein sequence'. 
Protein Accession #: NP_009141 

1 11 21 31 41 51 

I I 1 I I ! 

MRLLRRRHMP LRLAM VGCAF VLFLFLLHRD VSSREEATEK PWLKSLVSRK DHVLDLMLE A 60 
MNNLRDSMPK LQIRAPEAQQ TLFSINQSCL PGFYTPAELK PFWERPPQDP NAPGADGKAF 1 20 
QKSKWTPLET QEKEEGYKKH CFNAFASDRI SLQRSLGPDT RPPECVDQKF RRCPPLATTS 1 80 
VIIVFHNEAW STLLRTVYSV LHTTPA1LLK EULVDDAST EEHLKEKLEQ YVKQLQWRV 240 
VRQEERKGLI TARLLG ASVA QAEVLTFLDA HCECFHGWLE PLLARIAEDK TVWSPDIVT 300 
IDLNTFEFAK PVQRGRVHSR GNFDWSLTFG WETLPPHEKQ RRKDETYPIK SPTFAGGLFS 360 
IPKSYFEH1G TYDNQMEIWG GENVEMSFRV WQCGGQLEH PCSWGHVFR TKSPHTFPKG 420 
TSVIARNQVR LAEVWMDSYK KIFYRRNLQA AKMAQEKSFG DISERLQLRE QLHCHNFSWY 480 
LHNVYPEMFV PDLTPTFYGA IKNLGTNQCL DVGENNRGGK PUMYSCHGL GGNQYFEYTT 540 
QRDLRHNIAK QLCLHVSKGA LGLGSCHFTG KNSQVPKDEE WELAQDQLIR NSGSGTCLTS 600 
QDKKPAMAPC NPSDPHQLWL FV 

Seq ID NO: 67 DNA sequence 

Nucleic Acid Accession #: NM J>141 12 

Coding sequence: 600-4484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I 1 I I I 

TTCCTCCGCG AAGGCTCCTT TGATATTAAT AGTGTTGGTG TCTTGAAACT GACGTAATGC 60 
GCGGAGACTG AGGTCCTGAC AAGCGATAAC ATTTCTGATA AAGACCCGAT CTTACTGCAA 120 
TCTCTAGCGT CCTCTTTTTT GGTGCTGCTG GTTTCTCCAG ACCTCGCGTC CTCTCGATTG 180 
CTCTCTCGCC TTCCTATTTC TTTTTTTTTT TTTTAAACAA AAAACAACAC CCCCTCCCCT 240 
CTCCCACCCG GCACCGGGCA CATCCTTGCT CTATTTCCTT TCTCTTTCTC TCTCTCTCTC 300 
TCT CTTTTTT AATAAGGGTG GGGGAGGGAA AGGGGGGGGA GGCAGG AAAG ACCTTTTTCT 360 
CTCCCCCCCG CAATAATCCA AGATCAACTC TGCAAACAAC AGAAGACGGT TCATGGCTTT 420 
GGCCGCCGCG CCACCATCTT TCGGGCTGCC OAGGGTGTTC TTGACGATTA ATCAACAOAT 480 
GTACAGATCA GCTCTCAAAA TGTCTTCTGT GTCTTCTGAG CGTCTTCTAA GACAATTGCA 540 
TTAGCCTCCT GCTAGTTGAC TAATAGAATT AATAATTGTA AAAAGCACTC TAAAGCCACA 600 
TgCCTTATG A AGTCAATGCT GGGTATG ATT TTACAAATAT GGTCCGGAAA AAGAACCCCC 660 
CTCTGAGAAA CGTTGCAAGT GAAGGCGAGG GCCAGATCCT GGAGCCTATA GGTACAGAAA 720 
GCAAGGTATC TGG AAAGAAC AAAG AATTCT CTGCAGATCA GATGTCAGAA AATACGG ATC 780 
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AGAGTGATGC TGCAGAACTA AATCATAAGG AGG AACATAG CTTGCATGTT CA AG ATCCAT 840 
CTTCTAGCAG TAAGAAGGAC TTGAAAAGCG CAGTTCTGAG TGAGAAGGCT GGCTTCAATT 900 
ATGAAAGCCC CAGTAAGGGA GGAAACTTTC CCTCCTTTCC GCATGATGAG GTGACAGACA 960 
GAAATATGTT GGCTTTCTCA TTTCCAGCTG CTGGGGGAGT CTGTGAGCCC TTGAAGTCTC 1020 
CGCAAAGAGC AGAGGCAGAT GACCCTCAAG ATATGGCCTG CACCCCCTCA GGGGACTCAC 1080 
TGGAGACAAA GGAAGATCAG AAGATGTCAC CAAAGGCTAC AGAGGAAACA GGGCAAGCAC 1 140 
AGAGTGGTCA AGCCAATTGT CAAGGTTTGA GCCCAGTTTC AGTGGCCTCA AAAAACCCAC 1200 
AAGTGCCTTC AGATGGGGGT GTAAGACTGA ATAAATCCAA AACTGACTTA CTGGTGAATG 1260 
ACAACCCAGA CCCGQCACCT CTGTCTCCAO AGCTTCAGGA CTTTAAATGC AATATCTGTG 1320 
GATATGGTTA CTACGGCAAC GACCCCACAG ATCTGATTAA GCACTTCCGA AAGTATCACT 1380 
TAGGACTGCA TAACCGCACC AGGCAAGATG CTGAGCTGGA CAGCAAAATC TTGGCCCTTC 1440 
ATAACATGGT GCAGTTCAGC CATTCCAAAG ACTTCCAGAA GGTCAACCGT TCTGTGTTTT 1 500 
CTGGTGTGCT GCAGGACATC AATTCTTCAA GGCCTGTTTT ACTAAATGGG ACCTATGATG 1 560 
TGCAGGTGAC TTCAGGTGGA ACATTCATTG GCATTGGACG GAAAACACCA GATTGCCAAG 1620 
GGAACACCAA GTATTTCCGC TGTAAATTCT GCAATTTCAC TTATATGGGC AACTCATCCA 1680 
CCGAATTAG A AC AACATTTT CTTCAGACTC ACCCAAACAA AATAAAAGCT TCTCTCCCCT 1740 
CCTCTGAGGT TGCAAAACCT TCAGAGAAAA ACTCTAACAA GTCCATCCCT GCACTTCAAT 1 800 
CCAGTGATTC TGGAGACTTG GGAAA ATGGC AGG ACAAG AT AACAGTCAAA GCAGGAGATG 1 860 
ACACTCCTGT TGGGTACTCA GTGCCCATAA AGCCCCTCGA TTCCTCTAGA CAAAATGGTA 1920 
CAGAGGCCAC CAGTTACTAC TGGTGTAAAT TTTGTAGTTT CAGCTGTGAG TCATCTAGCT 1980 
CACTTAAACT GCTAGAACAT TATGGCAAGC AGCACGGAGC AGTGCAGTCA GGCGGOCTTA 2040 
ATCCAGAGTT AAATGATAAG CTTTCCAGGG GCTCTGTCAT TAATCAGAAT GATCTAGCCA 2100 
AAAGTTCAGA AGGAGAGACA ATGACCAAGA CAGACAAGAG CTCGAGTGGG GCTAAAAAGA 2160 
AGGACTTCTC CAGCAAGGGA GCCGAGGATA ATATGGTAAC GAGCTATAAT TGTCAGTTCT 2220 
GTGACTTCCG ATATTCCAAA AGCCATGGCC CTGATGTAAT TGTAGTGGGG CCACTTCTCC 2280 
GTCATTATCA ACAGCTCCAT AACATTCACA AGTGTACCAT TAAACACTGT CCATTCTGTC 2340 
CCAGAGG ACT TTGCAGCCCA GAAAAGCACC TTGGAGAAAT TACTTATCCG TTTGCTTGTA 2400 
GAAAAAGTAA TTGTTCCCAC TGTGCACTCT TGCTTCTGCA CTTGTCTCCT GGGGCGGCTG 2460 
GAAGCTCGCG AGTCAAACAT CAGTGCCATC AGTGTTCATT CACCACCCCT OACGTAG ATG 2520 
TACTCCTCTT TCACTATGAA AGTGTGCATG AGTCCCAAGC ATCGGATGTC AAACAAGAAG 2580 
CAAATCACCT GCAAGGATCG GATGGGCAGC AGTCTGTCAA GGAAAGCAAA GAACACTCAT 2640 
GTACCAAATG TGATTTTATT ACCCA AGTGG AAGAAG AGAT TTCCCGACAC TACAGGAGAG 2700 
CACACAGCTG CTACAAATGC CGTCAGTGCA GTTTTACAGC TGCCGATACT CAGTCACTAC 2760 
TGGAGCACTT CAACACTGTT CACTGCCAGG AACAGGACAT CACTACAGCC AACGGCGAAG 2820 
AGGACGGTCA TGCCATATCC ACCATCAAAG AGGAGCOCAA AATTGACTTC AGGGTCTACA 2880 
ATCTGCTAAC TCCAGACTCT AAAATGGGAG AGCCAGTTTC TGAGAGTGTG GTGAAGAGAG 2940 
AGAAGCTGGA AGAGAAGGAC GGGCTCAAAG AGAAAGTTTG GACCGAGAGT TCCAGTGATG 3000 
ACCTTCGCAA TGTGACTTGG AG AGGGGCAG ACATCCTGCG GGGGAGTCCG TCATACACCC 3060 
AAGCAAGCCT GGGGCTGCTG ACGCCTGTGT CTGGCACCCA AGAGCAGACA AAGACTCTAA 3120 
GGGATAGTCC CAATGTGGAG GCCGCCCATC TGGCGCGACC TATTTATGGC TTGGCTGTGG 3 1 80 
AAACCAAGGG ATTCCTGCAG GGGGCGCCAG CTGGCGGAGA GAAGTCTGGG GCCCTCCCCC 3240 
AGCAGTATCC TGCATCGGGA GAAAACAAGT CCAAGGATGA ATCCCAGTCC CTGTTACGGA 3300 
GGCGTAGAGG CTCCGGTGTT TTTTGTGCCA ATTGCCTOAC CACAAAGACC TCTCTCTGGC 3360 
GAAAGAATGC AAATGGCGGA TATGTATGCA ACGCGTGTGG CCTCTACCAG AAGCTTCACT 3420 
CGACTCCCAG GCCTTTAAAC ATCATTAAAC AAAACAACGG TGAGCAGATT ATTAGGAGGA 3480 
GAACAAGAAA GCGCCTTAAC CCAGAGGCAC TTCAGGCTGA GCAGCTCAAC AAACAGCAGA 3540 
GGGGCAGCAA TO AGGAGCAA GTCAATGG AA GCCCGTTAG A GAGGAGGTCA GAAGATCATC 3600 
TAACTGAAAG TCACCAGAGA G AAATTCCAC TCCCCAGCCT AAGTAAATAC GAAGCCCAGG 3660 
GTTCATTGAC TAAAAGCCAT TCTGCTCAGC AGCCAGTCCT GGTCAGCCAA ACTCTGGATA 3720 
TTCACAAAAG GATGCAACCT TTGCACATTC AGATAAAAAG TCCTCAGGAA AGTACTGGAG 3780 
ATCCAGGAAA TAGTTCATCC GTATCTGAAG GGAAAGGAAG TTCTGAGAGA GGCAGTCCTA 3840 
TAGAAAAGTA CATGAGACCT GCGAAACACC CAAATTATTC ACCACCAGGC AGCCCTATTG 3900 
AAAAGTACCA GTACCCACTT TTTGGACTTC CCTTTGTACA TAATGACTTC CAG AGTGAAG 3960 
CTGATTGGCT GCGGTTCTGG AGTAAATATA AGCTCTCCGT TCCTGGGAAT CCGCACTACT 4020 
TGAGTCACGT GCCTGGCCTA CCAAATCCTT GCC AAAACTA TGTGCCTTAT COCACCTTCA 4080 
ATCTGCCTCC TCA' M FTTCA GCTGTTGGAT CAGACAATGA CATTCCTCTA GATTTGGCGA 4140 
TCAAGCATTC CAGACCTGGG CCAACTGCAA ACGGTGCCTC CAAGGAGAAA ACGAAGGCAC 4200 
CACCAAATGT AAAAAATGAA GGTCCCTTGA ATGTAGTAAA AACAG AG AAA GTTGATAG AA 4260 
GTACTCA AG A TG AACTTTCA ACAAAATGTG TGCACTGTGG CATTGTCTTT CTGGATGAAG 4320 
TG ATGTATGC TTTGCATATG AGTTGCCATG GTG ACAGTGG ACCTTTCCAG TGCAGCATAT 4380 
GCCAGCATCT TTGCACGGAC AAATATGACT TCACAACACA TATCCAGAGG GGCCTGCATA 4440 
GGAACAATGC ACAAGTGGAA AAAAATGGAA AACCTAAAGA GTAAAACCTT AGCACTTAGC 4500 
ACAATTAAAT AGAA ATAGGT TTTCTTG ATG GG AATTCAAT AGCTTGTAAT GTCTTATGAA 4560 
GACCTATTAA AA AAATACTT CATAGAGCCT GCCTTATOCA ACATGAAATT CCCTTCTTTT 4620 
GTTATTCTTT CM 1 1 G ATG A GTAGGTTACC AAGATTAAAA AGTGAGATAA ATGGTCAATG 4680 
AGAAAGAATG GAAGATGGTA AACAATCACT TTTTAA AACC TGTTAAGTCA AAACCATCTT 4740 
GGCTAATATG TACTGGGGAA ATAATCCATA AGAGATATCA CCAGACTAGA ATTAATATAT 4800 
TTATAAAGAA AGAG ACCAAA ACTGTCTAGA ATTTGAAAGG GTTTACATAT TATTATACTA 4860 
AAGCAGTACT GG ACTGGCCA TTGGACCATT TGTTCCAAAA CCCATAAATT GTTGCCTAAA 4920 
TTTATAATGA TCATG AAACC CTAGGCAGAG GAGG AGAAAT TGAAGGTCCA GGGCAATGAA 4980 
AG AAAAATGG CGCCCTCTCA ATTTAGTCTT CTCTCATTGG CCATGTTTCA GATTTTGACC 5040 
TAGAAATGCG AGCTGTGGTT AGGCTTGGTT AGAGTGCAGC AAGCAACATG ACAGATGGTG 5100 ' 
GCACGCTGTT TTTACCCAGC CCTGCCTGTA CATACACATG CACACCCTCT CTG ATATT TT 5160 
TGTCCTTTAG ATGTTCAAAT ACTCAGTAGT CCTTTTGTTT GCGGTTTAGA TTCATT TTOT 5220 
CCACACATGT ACCCATTTTA AAAAACAATG TCCTCGATGC TTCTG TAGTG ATTTCATTTT 5280 
AGCCAGGTAT TTCTTTCTTG TGTGTGATGA ACCAGTATGG ATTTGCTTTT CTAAGCCTCC 5340 
TGTTGGTTAC TAATCTCACT TGGCACATTA TAACTAAAGG AATCCCCTCA ATTCAAAAGC 5400 
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ATAG ATGG AT ACA AATGTCA GACCGTGGGT TTAATTTGTT TAGAACACAT GGCATTTCTT 5460 
CACAAGGTA A CCTGCTGTAT TTATTTATTT TCTTTTGGTT AAATATAATT TCCAAACTTT 5520 
GTGGTCAGGC AGCGTCTAAG GTTACGTTAC CACAGACTGA CAGTTGGTAT ATGTACCAGC 5580 
CAATCCCTTC ATTAAATGTA TACAGATTTA GTTAAGTAGC ATTAAATAGG ATTCTTAGAA 5640 
GTATGTCCTC ATAGAACTTT TAATACTTAA GGCTTTGTAA AAACTATCCA TGAAGGGAAA 5700 
GCTCCTCAGC ATAACTGCTC AGGGAAATAG GGCTAAATAA CTGAACATTA AATAATTGGT 5760 
TAA AGGTGCT GTTAGTCG AG CCTCAATGCT TGCTACAAGG ATGTATGTAC AAGG ACTGAC 5820 
TTTAATAATT TGCATTATAT TGTCCCAACC AGTAGTTTAT TTTTTGCCAC GGAGATGTAG 5880 ■ 
AAGATATTAC AAGCTACTGG ATGCACTGTC AGATTAACTT ATTTCATTAA AGAAGTTGGG 5940 
AG AACAAATA GG AAAAA AAA AACTT ATTTT TCTAGTAAAT ATTAATGTAT TACATTTCAA 6000 
ATAATGGTGC CTGACATATT GAATAATTAT TTTCTACAGT GTACGTATGC AACAAAGATA 6060 
TTCCATCATG CATTAGAGTC AGTTCTGGCT CTGCCTAGCT GTTT ACATT T GCAAATGTAG 6120 
CAAACAAGGT AATGAAGCAA CTATTTCTAT TGCAGTAGAT ATCCTTTTGT GTGTGTGTGT 6180 
GTGCATTAAA GTTGTAAACG GTAACATGAA ACAAATGAAA GTTCTTGCTA TAATGGTATG 6240 
GAAAACAAGA AGGAAATGAA AATATTTTTA TGCCTACTTA GGAAAAAAAG GGTAGCACTT 6300 
ATTCATTCCA AGTACTTTTT TTTTTTTAAT TTTTAAGCTC TTAACTCACA TTGTTATGCT 6360 
TA AGATG ATA AAC ATATATC CTCTTTTTAT TGCTTTGTCT ATGTTT CATA T GAAACATTT 6420 
CAG AAATTAT TTTG ATAAGT GTTGCTGGAA TCTGCAACGC TGATTTTTTT TTGCATTCTG 6480 
TAGTCGCATT TGCACTCCAT TTTTACATTA ATTCGCAGTT GCTTTGTATC ATTGTTTTGT 6540 
TTGGGTTTTG TTTCTTTTTC ACAGTGCCGG GTCTTCGTTT CTTAAAGTTG G ATGGC AGGT 6600 
AGAGTTCAAC CAGTTCGTGA CTGTTGTAGC GAATGAAGTT AAAAAAATGT CTTTCTGATG 6660 
. TTGTGTTGTC ATTTTCATTT TTOC ATTTTT TTGTTTGCAT ATT AA AAAAA GAGAAAAGAG 6720 
AAAGCAAGAG ACAGAAATCA GGACTAAGTC CTCTGCTTCA GTTTCATTGT TAACGGGCCT 6780 
TATTCTGATC TCACCTGTCG CGTAGCTCTA ATATTCACAT AAACTGAAAT AAAGAAGTGG 6840 
AATGAGGAGC TTTG ACATTC AAATTATGTG ATGTAATTTA TCTTCCTTAG GAATTTTGAT 6900 
GGATGCATCT CAAAATGTAT AGCCAGACTT GAGAGGTGAC AATTAAAGAT CTAAAAAAGA 6960 
GAGGAGATTC CCCCAAACAA CAATATTTAA TTTTCTTAGT AAAAAGAATA ACAGAATGCA 7020 
TCGTGGCAAT OCTTAAGCAA CATTATCTAT GTGGACTGCT TAAATCAGCA AAACACCAGA 7080 
AGTTTGGTTA ACTTGGGCAA TATGACAAGT ATTACTTTTT GGGCAAAACT ACTCATTAAQ 7140 
CAATTTCTCT AGTGTGTCGG ACACAAATAG GTTCTTTATT TTTGGCATGT ATGCCTTTTT 7200 
ATTTTCATTC AATTTTTTTT TTTTCTCAGA CAGACATAGT AGTATCAACT AGCAT TGGAA 7260 
AATACATATC ACT ATTCTTG G AATATTTAT GGTCAGTCTA CTTTTTAGTA AAATATTTTT 7320 
GG ATAGCGTT G ACACGATAG ATCTTATTCC ATACTTCTTT ATTATTGATA ATTTTATTTT 7380 
CATTTTTTGC TTTCATTATT ATACATATTT TGGTGGAGAA GAGGTTGGGC n i l I lG AAA 7440 
OAGACAAAA A TTTATTATAA CACTAA ACAC TCCTTTTTTG ACATATTAAA GCC TTTAT TC 7500 
CATCTCTC AA G ATATATTAT AAAATTTATT TTTTTAATTT AAGATTTCTG AATTATTTTA 7560 
TCTTAAATTG TGATTTTAAA CGAGCTATTA TGGTACGGAA CTTTTTTTAA TGAGGAATTT 7620 
CATGATGATT TAGOAATTTT CTCTCTTGGA AA AGGCTTCC CCTGTG ATG A AAATGATGTG 7680 
CCAGCTAAAA TTGTGTGCCA TTTAAAAACT GAAAATATTT TAAAATTATT TGTCTATATT 7740 
CTAAATTGAG CTTTOG ATCA AACTTTAGGC CAGGACCAGC TCATGCGTTC TCATTCTTCC 7800 
TTTTCTCACT CTTTCTCTCA TCACTCACCT CTGTATTCAT TCTGTTGTTT GGGATAG AAA 7860 
AATCATAAAG AGCCAACCCA TCTCAGAACG TTGTGGATTG AGAGAGACAC TACATGACTC 7920 
CAAGTATATG AGAAAAGGAC AGAGCTCTAA TTGATAACTC TGT AGTTCAA AAGGAAAAGA 7980 
GTATGCCCAA TTCTCTCTAC ATG ACATATT GAGATTTTTT TTAATCAACT TTTAAGATAG 8040 
TGATGTTCTG TTCTAAACTG TTCTGTTTTA GTGAAGGTAG ATTTTT ATAA AACAAGC ATG 8100 
GGGATTCTTT TCTAAGGTAA TATTAATGAG AAGGGAAAAA AGTATCTTTA ACAGCTCTTT 8160 
GTTGAAGCCT GTGGTAGCAC ATTATGTTTA TAATTGCACA TGT GCACA TA ATCTATTATG 8220 
ATCCAATGCA AATACAGCTC CAAAAATATT AAATGTATAT ATATTTTAAA ATGCCTGAGG 8280 
AAATAC ATTT TTCTTAATAA ACTGAAGAGT CTCAGTATGG CTATTAAAAT AATTATTAGC 8340 
CTCCTGTTGT GTGGCTGCAA AACATCACAA AGTGACCGGT CTTGAG ACCT GTG AACTGCT 8400 
GCCCTGTTTA GTAAATAAAA TTAATGCATT TCTAGAGGGG GAATATCTGC CA TCCAGT GG 8460 
TGGAAATGTG GAGTAAAGAA GCTGGTGGTC TGCTTCTGTG CTGTATGCC A GCCTTTTGCC 8520 
TTAAGTTGAG AGGAGGTCAA CTTTAGCTAC TGTCTTTGGT TTGAGAGCCA TGGCA AAAAA 8580 
AAAAAAAGAA AAAAAGATCA AGTCGTCTTT GGTGAGCCAG TAAGGTGAAA GCTTGCTGAC 8640 
TGTCCAAGGC ACAAG AGAAA ATTGAGG AAT TGAAATGCAA CCTG AGTATC AAACTAAATA 8700 
TTCTAATCAA AGGTAGGTAC TGTTAGGTGG AATTCTATCA OCAG GCAAC T GCAAATGAGA 8760 
AGAAGATAGA AGGACGCCCG TCGGGACTTT GGAQGGCATT GTTA TTTTCC CAAAGAAAGA 8820 
CGGCCAAGGG CAGAGGCATG GATTCTTTGC AG AGCACTTC CTTTTGGTTT TTCAGTACTG 8880 
TTTCATAGAC AGTGGGCTCA CATGTTCCTG ATAGTGCTGC AGTTGCTTAG AAAGCATCCC 8940 
AGTTAATTGC AGTAATTAGA ACTTCTGGAA TATGCTAGGG CAGAAGTATG TCAAGTATGT 9000 
CACATG AAGA AAATGTGAAA TTCAAGAGTA ATCCACACGT GAG AAACTAG ACAATGTACA 9060 
TTCATGTGTT CTCTTGAAAG GAAAGGGAGA GCTGTAAGCT TCAC TCTGTC CTACACCGGA 9120 
GAAAAGCAGG AATAACTTTA CCGTGGAAAT AATGTTTAGC TTTTA TCAG A GAAAATTGTC 9180 
CTTCTAG AGC ATAGAGTCCC AAAACTCAAT TCTGGTTTTC OCCTGTTTTT TTTTTTTTTT 9240 
TTTTTCCCAA CATATGAACT GCAGCATATC ACTTTTTCTT TTTGTGCCTC AGGTTCCTCA 9300 
CCTGTA A AAT TG AAAAATAT ATGTATTAAT AATATTATTA ATAATAATAA TGGTAATGTA 9360 
GTACTTGTTT GTAAAGCACT TTG AG ATCCT TGGTTGA AAG GCACCATAGG AGTGCCAAGT 9420 
ATTATTATGT GGCCAAGGGG GTTATTTAAA CTGTCAGTTC CCAAAGGCCA GGAAAGGTTG 9480 
GGGTC ATTTT TCTTAAAGAC GAGCTGTAAA TATCAACTAG GCAGCCAATA GTGTTGACTA 9540 
TG AAGATGCA AAACTATTAC TAGGCTGATA AAATCATAGT TTCTTAATGG CTACCAATAA 9600 
GGCAAATATC ACAATAATAA ACGCCAAATT CCTTAGGGCG GACTATTTGA CAACCACATG 9660 
GAAAACTTTG GGGGAGGCAT GAGGGGGOAA CATCTCAAAA TGCCAATGTA AAATTTAACT 9720 
TACAGCAATA TTCACCAGCA GAAAATGTCT TTCATATGG A ATGATTTCAT GTTGCTAAGA 9780 
AAAAGA ATTC AATTTGTAGT CCTGATTTGA ATACTAGAAT GTTGGCTATA ATAGTTCTGT 9840 
TCTTACAACA CATG AAATTT TTTCGTTTTA TTTTATTTTG TTTTCATAGT GCATGTTCAT 9900 
TTCTACTCAC AAACATGTTC TTGGTGTATT TCTTATGCAA ACAATCTTCA GGCAGCAAAG 9960 
ATGTCTGTTA CATCTAAACT TG AATAATAA AGTTTTACCA CCAGTTACAC A 
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Scq ID NO: 68 Protein sequence: 
Protein Accession U: NPJ)54831 

1 11 21 31 41 51 

MPYEVNAGYD FTNMVRKKNP PLRNVASEGE GQILEPIOTE SKVSOKNKEF SADQMSENTD 60 
QSDAAELNHK EEHSLHVQDP SSSSKKDUCS AVLSEKAGFN YESPSKGGNF PSFPHDEVTD 120 
RNMLAFSFPA AGG VCEPLKS PQRAEADDPQ DMACTPSGDS LETKEDQKMS PKATEETGQA 1 80 
QSGQANCQGL SPVSVASKNP QVPSDGG VRL NKSKTDLLVN DNPDPAPLSP ELQDFKCNIC 240 
GYGYYGNDPT DUKHFRKYH LGLHNRTRQD AELDSKILAL HNMVQFSHSK DFQKVNRSVF 300 
SGVLQDINSS RPVLLNGTYD VQVTSGGTFI GIGRKTPDCQ GNTKYFRCKF CNFTYMGNSS 360 
TELEQHFIjQT HPNKIKASLP SSEVAKPSEK NSNKSIPALQ SSDSGDLGKW QDKITVKAGD 420 
DTPVGYSVPI KPLDSSRQNG TEATSYYWCK FCSFSCESSS SLKLLEHYGK QHGAVQSGGL 480 
NPELNDKLSR GSVINQNDLA KSSEGETMTK TDKSSSG AKK KDFSSKGAED NMVTSYNCQF 540 
CDFRYSKSHG PDVI WGPLL RHYQQLHNIH KCTIKHCPFC PRGLCSPEKH LGEITYPFAC 600 
RKSNCSHCAL LLLHLSPGAA GSSRVKHQCH QCSFTTPDVD VLLFHYESVH ESQASDVKQE 660 
ANHLQGSDGQ QSVKESKEHS CTKCDFITQV EEEISRHYRR AHSCYKCRQC SFTAADTQSL 720 
LEHFNTVHCQ EQDITTANGE EDGHAISTIK EEPKIDFRVY NLLTPDSKMG EPVSESWKR 780 
EKLEEKDGLK EKVWTESSSD DLRNVTWRGA DILRGSPSYT QASLGLLTPV SGTQEQTKTL 840 
RDSPNVEAAH LARPIYGLAV ETKGFLQGAP AGGEKSGALP QQYPASGENK SKDESQSLLR 900 
RRRGSGVFCA NCLTTKTSLW RKNANGGYVC NACGLYQKLH STPRPLN1IK QNNGEQIIRR 960 
RTRKRLNPEA LQAEQLNKQQ RGSNEEQVNG SPLERRSEDH LTESHQREIP LPSLSKYEAQ 1020 
GSLTKSHSAQ QPVLVSQTLD IHKRMQPLM QDCSPQESTG DPGNSSSVSE GKGSSERGSP 1080 
1EKYMRPAKH PNYSPPGSPI EKYQYPLFGL PFVHNDFQSE ADWLRFWSKY KLSVPGNPHY 1 140 
LSHVPGLPNP CQNYVPYPTF NLPPHFSAVG SDNDIPLDLA IKHSRPGPTA NGASKEKTKA 1200 
PPNVKNEGPL NWKTEKVDR STQDELSTKC VHCG1VFLDE VMYALHMSCH GDSGPFQCSI 1260 
CQHLCTDKYD FTTHIQRGLH RNNAQVEKNG KPKE 

Seq ID NO: 69 DNA sequence 

Nucleic Acid Accession #: XM 073879 

Coding sequence: 1 -3 8 7( underlined sequences correspond to start and stop codons) 
1 11 21 31 41 SI 

ATGGGGTTTG GAGACCAOGG AACGGTGGAA GGGAGCCTAG GAACGTOGAA AAAACCACCT 60 
GAAGTGAAAA TGTTTGGAGC CAGTC AAGGT TTGCTGACAA TGGAAACAAA C CAGTC CCTG 120 
GCACAAGGCA CAGGCTGCTC AGTAGTAAAG GTAG ACACTG TTCTCTTCGA GAGTTTATAC 1 80 
CACTGCGGCT TTGAACACGG GAGCGTGATG CACTGCCTTG GGGATGATCA CCCCCAGGAA 240 
GACAGGAAGG CTCACTTCTC TGCCCCAGTT GCAGCCATCG CCTCTCCAGC ACCGACTCCT 300 
GTCTGTCCTG CAGACCACTC AACACAGAGC ATCTGCCAGT TTCTACAGCA CTGCAGGCAG 360 
AACACTCACT TGCAGGCTGC TAACTAA 

Sea ID NO: 70 Protein sequence: 
Protein Accession #: XPJ)73879 

+ 

1 11 21 31 41 51 

IfGFGDQGTVE GSLGTSKKPP EVKMFG ASQG LLTMETNQSL AQGTGCSWK VDTVLFESLY 60 
HCGFEHGSVM HCLGDDHPQE DRKAHFSAPV AAIASPAPTP VCPAHHSTQS ICQFLQHCRQ 120 
NTHLQAAN 

Scq ID NO: 71 DNA sequence 

Nucleic Acid Accession U: AB033064 

Coding sequence: 826-1 986 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GGGGAACAGC AAATTCAGTC ACAGACAATC CTCCACTCGG TCAAGAGCCA CTTTTCTCTT 60 
CCTGCCTTGC CCCCCCGCAG GGGGTAAGGA ACTGAGCGTT TAATCTTTAG CCGGTTGGCT 120 
ACCAGCTAAA ATTCTACTTA TCTTAGTTTC TAGTGGATAG CTTTCTTATT TTGCCCATGT 180 
TTTCTTAGAA TCCCTGTTTA ATATACTTTT GTCAGTAGTA GTATCTAGGA GTAGCAGGGA 240 
GAGTG ACAAT AAATTAGCCC CTTCTTTTTT CCCTTGTCAT TCAGGCCCCT TTTCCTCTCC 300 
AGAGGGAAAT TACCAGTAAA CTCTTCTAAA TCTTCCACCC CTTCTCAGTC ATACTGTGAA 360 
GAAACACACT AA AGTGG ACA TTATTTGACC AGTGAACACG AACCCAGCTT CAGGCATTGG 420 
TTTGTTGTGG CACATGGAG A AACATCTCTT TTAAAATATC TCCCAATTAC CCTTTTCACA 480 
ATTTGTATCC ACCTAGG ATT TGCTGCTGGG GTAAGTCACT AG ATTTATTT CTCAAAGCTC 540 
CCCTCTCTAT GAGCTGAAAG ACTGACCAAC CATGAACACT AGTAGGGGAT GGGGAAAGGG 600 
GACAGAGCAG AGCCAGTTGT TCCACACTTT GGG AAGCAGG AGTAGCTTTT ATCATCTTCC 660 
TCTGGGGAGC AGGCATAGAG ACATAAACTG AGTGAAAATG GGTGGAGGAA GAACTTCTAT 720 
ACCCACGAAC AACATGTG AA GAG AG AG AAC CAAACATAAA GTAAGGAGGG TAGACGTTAC 780 
ATCCAAGAGG AAATAATCCA GGCAAGGAAG CACAAGCTGA TCA AGAJGTG TAGTTCTGTG 840 
GCTGCCAAGT TGTGGTTTTT GACAGATCGT CGCATCAGGG AAGA CTATC C TCAAAAAGAG 900 
ATTTTACG AG CATTG AAGGC CAAATGTTGT GAGGAGGAAC TGGACTTTAG GGCTGTGGTG 960 
ATGGATGAGG TGGTGCTGAC AATCGAGCAA GGAAACCTGG GTCTGCGGAT CAATGQAGAG 1020 
CTAATCACTG CCTACCCACA AGTGGTGGTA GTCAGAGTAC CAACCCCTTG GGTGCAAAGT 1080 
GATAGTGACA TCACTGTTTT GCGCCATCTA GAGAAGATGG GATGTCGGTT AATGAACCGA 1 140 
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CCTCAAGCCA TCCTGAACTG CGTTAATAAG TTCTGG ACAT TTCAAGAGTT GGCTGGCCAT 1200 
GGTGTTCCTC TGCCGGATAC TTTCTCTTAT GGTGGOCACG AAAATTTTGC TAAAATGATT 1260 
GATGAGGGTG AAGTTCTGGA GTTCCCAATG GTAGTAAAGA ATACGCGGGG TCACAGAGGT 1320 
AAAGCTGTTT TCTTGGCTCG AGATAAGCAC CATTTGGCTG ATCTAAGCCA TCTTATTCGC 1380 
CATGAAGCGC CATACCTGTT CCAGAAGTAT GTTAAAGAGT CTCATGGACG GGATGTACGT 1440 
GTCATTGTCG TGGGAGGCCG TGTGGTTGGC ACCATGTTAC GTTGTTCAAC AGATGGGAGA 1500 
ATGCAAAGCA ACTGCTCATT AGGTGGTGTG GGGATGATGT GCTCATTGAG TGAACAAGGG 1560 
AAGCAGCTAG CTATCCAGGT GTCTAATATC CTGGGGATGG ATGTGTGTGG CATTGACCTG 1620 
CTGATGAAAG ATG ACGGCTC CTTCTGCGTC TGTGAGGCCA ATGCAAATGT AGGTTTCATC 1680 
GCCTTTGATA AGGCTTGTAA TCTAGATGTA GCTGGTATCA TAGCAGACTA TGCCGCCTCC 1740 
CTTCTACCCT CTGGCCGGCT CACCCGGCGT ATGTCCCTGC TCTCCGTGGT GTCCACTGCC 1 800 
AGTGAGACTA GTGAGCCGGA GCTGGGTCCC CCAGCCAGCA CTGCTGTTGA CAACATGAGT 1860 
GCAAGTTCCA GCTCTGTTGA CAGCGACCCT GAAAGCACGG AGCGAGAGCT GCTCACCAAG 1920 
CTCCCAGGGG GCCTGTTCAA CATGAACCAG CTGCTAQCCA ATGAAATCA A ACTACTGGTG 1980 
GACTGACTCC ACTGGTAATT AACCAACAAA ACCCTTGTAA AACTTTCTTT CTTCTTTTCT 2040 
ATTTTTAAAA CCAACTTGCA ATGCTGTTCA TGGAGGATGC TCAGGAAGAT GAGAGAAAAT 2100 
TAGTAGGATT AGTTGGAGAG AGTGGGAGAT AGATGAGACC TCTGCTAGTA AGATGTTACT 2160 
TTCATTTACA AATCCTACAA ATAGAGAGGC AGAATAGGTG GGGTATAGAA AAATGTCAGG 2220 
CTCTCATAGT TACCCTTTTA AATTGCTAAA AAATGTGTAT GCTGATAGGC CATGA GGAA C 2280 
AAATACTTTT TTTTTTTCAT GGTCCCTTGC TTTTGTTTTT GTACAAAAAA AAATGGTTTT 2340 
GCTACAAATA TCCAAGTAGC ATAACTTCAC ATTGTGTTGG AAGATTTGf C ATCAGTGAGG 2400 
AAAACATCTG CTTAAATTAC AGGAATTTTT GTATTATACA GCTCTGAAAA TTCTGCCATT 2460 
TCCTTATTAA CTAGCAGCTT TAGTTTGTAG TTTATG AAAT CTTGAGGGGC TCTTTTACTG 2520 , 
GGATTTCTTA TTTTTTTOTT TTTTCCCGCT TAATTTGGTG GGAGGTCAAA TTGAATATAA 2580 
CCCAATAAAG GCTTCTTAAT G ACAAAATTG GCATGTTTGC ATGATGAAAT GGAAATGAAC 2640 
AGTATTGCAA TGTCCGGTAT ACAAAATAAC ATTAATTCAA TGTAGATAAA ATTACACTAG 2700 
TTTAAAATAT GTGGATTCAC TTGTATTTGT TAGTGTTTTA GTCTTTTTTG AAAGATGTGC 2760 
TCTGTTAATG TTGCTTTTTT ' lT I T 'l ' l 11 1 1 TAATACATGC TAGTCTA ACA TTTCCTGCTC 2820 
TATGCCTGCA TCTTTAACAA TGGCCAAAGT GAAOAAAATG CTACCTTTTT TGTTAACAAG 2880 
ACACTGACTT GAAACATGTA CATTTAAAGC CTTTTATTTT TTCCCTTTTT GTTTTGGTAG 2940 
TTGGGCATTT AAATAAGG AC AAGGAAAAAT ATTTTTGGGG GCAAATCAAG AGCCT ATGAG 3000 
TTCTAAGTAT AAAGCTGAAG TGATTTCGAA TGCCAGCGTT ATATATTTGC A Tl'I'l 1' CACA 3060 
TTTTACGAGG GAGTATATGT GTATGTGTGT GCACGCATGC ATGTGTATGT GTTTTGCTTT, 3120 
TTGTTTCCAT CAACTAATCA AAAAGGATAA TTTAGAAAAT GGAGCATGAT GGGAAACAGA 3180 
GTTTTTG ACT TTAAAAAACA GATGAGTTGT TTTCATA AGT AG ACTCCA CT GGGGTAGAGG 3240 
TATTCACCTT AAAACATAGG GTGAGTAGAT GCTTTTTTAG GCCTTTTTGT GTATATGTAC 3300 
GTTGTTTGTT TTTTTCCTTT TGTTTCTAGC CTGTTCAGTG TACAGTTTAT TCAAGGCTAC 3360 
ATGCTTTTCT TTAATGCTTC TGGCTATGCA TTTTCTCTTT TTACATATAG GATTTGGGAT 3420 
TGGGGGTGGG TTGGATGTTT TTGTTTGGGG ACTTATTTAG TAGTA TTGAG TCTCTTATAG. 3480 
CCCTACTCTT A AGCCTTCAA TACTGTCCAC TCTTTATATT CCTTTACTTG CAGAATTTAT 3540 
A AAAGCCCCC AA ACTGC ATA TAATATGAGC CTTTAAAACA TGGGTAAAAC TAA TCCCA TT 3600 
GATGGGTTTG GATGGTATGT TAAGAAATGG AGATGCTGCA GAGCCCAACG TAATTTTTTA 3660 
AACAGCAAGT TTTCCATCTC CCTACGAATC CTCTGAAGCT TTTACCCAAG CCCTTTCTTG 3720 
CCTCTCCAGT GCTATTTTCC TTCAGATGGA CCTTAAACAT AATTTCTTGG ACACTACTAG 3780 
AGAGACTTCG AGGCAATAAT AAAAGATCAG TATTAACCAG CTATAACAGA GGTTTGATCA 3840 
TGCTTACTTG TACAGTTTTT CCCCCGTTTT AAAA AGGAAT GTAATAAAAT TTGTTTTTTC 3900 
CATAG AATTA AATAATATTA AAATTG AGTG AAAGGTTGAT TGTTG ATGAA TAGAATAGTA 3960 
CCTCTCATCT GTGCAGTGTC TCATTTCACC TCAG AGAAAA GGATACATAA GAGGAGTTTG 4020 
TAATTTATCT TAGGATATTC TAATTGCATT TAAAAGAACT TATCTTGCGC AGGGTAAATG 4080 
GGGGACTCAC ATACATATAT TAATACCTCT GACTCATTAA CAGAAAGAAA TACTTGGTAC 4140 
TTCTTTCGCT GAATG ACCAT ACTGTGGAGG ATGCATACTA TTTGGTATAG AGAAATAAAT 4200 
G AGGAAGAAA GAACTGCTTA ATTAAATTAT CATTCATATG TTCATATAGA GACCATCTGG 4260 
TTGCCATGTG TATTATGACA CATACACTTT GAATAGTTAC ATATCACAAG TATGTAGTTC 4320 
ATGTTTGTGT TGGTGGGGTA AGGCATCAGG AAAAATGTAG TTAGTCTTTT CTTAA CTTAT 4380 
ACCA AATTAA CCAACTATAT TATAGG AAAT ATGTGAAATT AGTTCATTAG CTTTATTCAC 4440 
TATTATGCAT TCACATGATA TTAAAACGTA CACTCACATG TTAGAATG AA AAGAGCAGTA 4500 
GTTATCTTAG ATTTTAAAA A CATGGATATC TTCTTGAATT CCTTCAAG AT TGAGGTAG AG 4560 
AATAAGAGCA AATCATTCTG GAAGTACCTT AAGGAAACAA ACAGCAGCAG ATATTTAGGT 4620 
TAAACTTATT TTCATAATTG TTTAATAACT TTTGTATAAT CTTCATTGCT ATTATGAGAG 4680 
AGAATGTATA TATCAAATAT GTGTAATGAT AAAATCTGAA TTGTAAAATT T TTGTA TATT 4740 
GTTAAAATTG TAATTCTAAA TTGTATTTCA AAAATGATTA TTTCTGATAT TGI 1 1 1 1 ATG 4800 
TCACCCATGA TGAAAACTGG ACTTTATATA TCTAAACATA CAAGTATGAA CTATTCTATT 4860 
TAAAATTTTT AATAGTTTTT TTCTTTTTTG GTGCCTATAA TTGATTGGTC ATTTCTGCTG 4920 
GCTTTTCTCC AATGAACATT GAAATCTTCC TGTATATGTT ACCAATAAG A AAACTA CCCT 4980 
GGAACAGTAG AAAAACCCAA CAAGAGACTT GGCATTCATC AAGCACATTA TCAGACTTTG 5040 
AGAACATATT GAAGGCATTG ACTTTGAAAA TCATCTCTTT TTCTCAAG AA GAAAGCAATG 5100 
GAGAAGCAAA TTTGTTTCAT TCAGTGAATC CCCAGTTTGG G GCTTGTGG G GCTTAGAGAC 5160 
ATTGTGAAAT CAAATCTTGT GTTATACTTT TCTCCTGGCT CACTTTTTTT GAGAAGGTTT 5220 
ATGGGCTATT TGGCTGGTGA GACACGATCC CCTCCTAAGA AAATGTAGGT GCTCAGACAG 5280 
GTAACCACTG CTGCTACTGT TTTTATTTGT TTGTTTGTTC AATTTTATTT AAGATTTGTT 5340 
TTTGTTGTAC TAGG ATTTTA AAAAATGTAA TATATTGCAG GATTTATAAC CAG 

Seq ID NO: 72 Protein sequence: 
Protein Accession #: BAA86552 



1 11 21 31 41 51 
I I I I I I 
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MCSSVAAKLW FLTDRRIRED YPQKEILRAL KAKCCEEELD FRA VVMDEV V LTIEQGNLGL 60 
RINGELITAY PQVVVVRVPT PWVQSDSDIT VLRHLEKMGC RLMNRPQAIL NCVNKFWTFQ 120 
ELAGHGVPLP DTFSYGGHEN FAKMIDEAEV LEFPMWKNT RGHRGKAVFL ARDKHHLADL 180 
SHLIRHEAPY LFQKYVKESH GRDVRVIVVG GRWGTMLRC STDGRMQSNC SLGGVGMMCS 240 
LSEQGKQLAI QVSNILGMDV COIDLLMKDD GSFCVCEANA NVGFIAFDKA CNLDVAGIIA 300 
DYAASLLPSG RLTRRMSLLS WSTASETSE PELGPPASTA VDNMSASSSS VDSDPESTER 360 
ELLTKLPGGL FNMNQLLANE DCLLVD 

Seq ID NO: 73 DNA sequence 

Nucleic Acid Accession*: XM.040080.2 

Coding sequence: 159-1 104 (underlined sequences correspond to start and stop codons) 
I 11 21 31 41 51 

i r i i i i 

CTGAGTGGGG GCGGGGACTG CTGGAGTTGC GGGGCCTGCC TGGGGTAGGG CGGGGCAGGA 60 
CAGCTTGGAG ATAGGGCCCG GAATTGCGGG CGTCACTCTG CTCCTGCGAC CTAGCCAGGC 120 
GTGAGGGAGT G ACAGCAGCG CATTCGCGGG ACGAGAGCG A TG AGTGAGAA CGCCGCACCA 1 80 
GGTCTG ATCT CAGAGCTG A A GCTGGCTGTG CCCTGGGGCC ACATCGCAGC CAAAGCCTGG 240 
GGCTCCCTGC AGGOCCCTCC AGTTCTCTGC CTGCACGGCT GGCTGGACAA TGCCAGCTCC 300 
TTCGACAGAC TCATCCCTCT TCTCCCGCAA GACTTTTATT ACGTTGCCAT GGATTTCGGA 360 
GGf CATGGGC TCTCGTOCCA TTACAGCCCA GGTGTCCCAT ATTACCTCCA G ACTTTTGTG 420 
AGTG AGATCC GAAGAGTTGT GGCAGCCTTG AAATGG AATC GATTCTCCAT TCTGGGCCAC 480 
AGCTTCGGTG GCGTCGTGGG CGG AATGTTT TTCTGTACCT TCCCCGAGAT GGTGGATAAA 540 
CTTATCTTGC TGGACACGCC GCTCTTTCTC CTGGAATCAG ATG AAATGG A GAACTTGCTG 600 
ACCTACAAGC GGAGAGCCAT AGAGCACGTG CTGCAGGTAG AGGCCTCCCA GGAGCCCTCG 660 
CACGTGTTCA GCCTGAAOCA GCTGCTGCAG AGGTTACTGA AGAGCAATAG CCACTTGAGT 720 
GAGGAGTGCG GGGAGCTTCT CCTGCAAAGA GGAACCACGA AGGTGGCCAC AGGTCTGGTT 780 
CTG AACAGAG ACCAGAGGCT CGCCTGGGCA GAGAACAGCA TTGACTTCAT CAGCAGGGAG 840 
CTGTGTGCGC ATTCCATCAG GAAGCTGCAG GCCCATGTCC TGTTGATCAA AGCAGTCCAC 900 
GGATATTTTG ATTCAAGACA GAATTACTCT G AGAAGG AGT CCCTGTCGTT CATGATAGAC 960 
ACGATG AAAT OCACCCTCAA AG AGCAGTTC CAGTTTGTGG AAGTCOCAGG CAATCACTGT 1020 
GTCCACATGA GCGAACCCCA GCACGTGGCC AGTATCATCA GCTCCTTCTT ACAGTGCACA 1080 
CACATGCTCC CAGCCCAGCT GTAGCTCTGG GCCTGG AACT ATGAAG ACCT AGTGCTCCCA 1140 
GACTCAACAC TGGGACTCTG AGTTCCTGAG CCCCACAACA AGGCCAGGGA TGGTGGGGAC 1200 
AGGCCTCACT AGTCTTGAGG CCCAGCCTAG G ATGGTAGTC AGGGGAAGGA GCGAGATTCC 1260 
AACTTCAACA TCTGTGACCT CAAGGGGGAG ACAGAGTCTG GGTTCCAGGG CTGCTTTCTC 1320 
CTGGCTAATA ATAAATATCC AGCCAGCTGG AGGAAGGAAG GGCAGGCTGG GCCCACCTAG 1380 
CCTTTCCCTG CTGCCCAACT GGATGGAAAA TAAAAGGTTC TTGTATTCTC A 

Seq ID NO: 74 Protein sequence: 
Protein Accession*: XP_040080.1 

1 11 21 31 41 51 
I I I I I I 

MSENAAPGLI SELKLAVPWG HIAAKAWGSL QGPPVLCLHG WLDNASSFDR LIPLLPQDFY 60 
YVAMDFGGHG LSSHYSPGVP YYLQTFVSEI RRWAALKWN RFSILGHSFG GWGGMFFCT 120 
FPEMVDKLIL LDTPUFLLES DEMENLLTYK RRAIEHVLQV EASQEPSHVF SLKQLLQRLL 180 
KSNSHLSEEC GELLLQRGTT KVATGLVLNR DQRLAWAENS IDFISRELCA HSIRKLQAHV 240 
LLIKAVHGYF DSRQNYSEKE SLSFMIDTMK STLKEQFQFV EVPGNHCVHM SEPQHVASU 300 
SSFLQCTHML P AQL 

Seq ID NO: 75 DNA sequence 

Nucleic Acid Accession #: NMJM5794 

Coding sequence: 434-1276 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGTTCCCTTC CACGCTGTGA AGCTTTGTTC TTTTGGTCTT CATGATAAAT CTTGCTGCTG 60 
CTCACTCGTT GGGTCCGTGC CACCTTTAAG AGCTGTAACA CTCACCGCGA AGGTCTGCAA 120 
CTTCACTCCT GGGGCCAGCA AGACCACG AA TGCACCG AGA GG A ATGAACA ACTCTGGACA 1 80 
CACCATCTTT AAGAACCGTA ATACTCACCG C A AG GGTCTG CAACTTCATT CTTGAAGTCA 240 
GTGAGGCCAA GAACCCATCA ATTCCGTACA CATTTTGGTG ACTTTGAAGA GACTGTCACC 300 
TATCACCAAG TGGTGAGACT ATTGCCAAGC AGTGAG ACTA TTGCCAAGTG GTGAGACCAT 360 
CACCAAGCGG TGAGACTATC ACCTATCGCC AAGTGGCCTG ATTCAGCAGG AAGCATCTCA 420 
GACACCAACC ACTATGCTGT CAGCAGTTGC CCGGGGCTAC CAGGGCTGGT TTCATCCCTG 4S0 
TGCTAGGCTT TCTGTGAGGA TGAGCAGCAC CGGGATAGAC AGGAAGGGCG TCCTGGCTAA 540 
CCGGGTAGCC GTGGTCACGG GGTCCACCAG TGGGATCGGC TTTGCCATCG CCCGACGTCT 600 
GGCCCGGGAC GGGGCCCACG TGOTCATCAG CAGCCGGAAG CAGCAGAACG TGGACCGGGC 660 
CATGGCCAAG CTGCAGGGGG AGGGGCTGAG TGTGGCGGGC ATTGTGTGCC ACGTGGGGAA 720 
GGCTGAGGAC CGGGAGCAGC TGGTGGCCAA GGCCCTGGAG CACTGTGGGG GCGTCGACTT 780 
CCTGGTGTGC AGCGCAGGGG TCAACCCTCT GGTAGGGAGC ACTCTGGGGA CCAGTGAGCA 840 
GATCTGGGAC AAGATCCTAA GTGTG AACGT G AAGTCCCCA GCCCTGCTGC TGAGCCAGTT 900 
GCTGCCCTAC ATGGAGAACA GGAGGGGTGC TGTCATCCTG GTCTCTTCCA TTGCAGCTTA 960 
TAATCCAGTA GTGGCGCTGG GTGTCTACAA TGTCAGCAAG ACAGCGCTGC TGGGTCTCAC 1020 
TAGAACACTG GCATTGGAGC TGGCCCCCAA GGACATCCGG GTAAACTGCG TGGTTCCAGG 1080 
AATTATAAAA ACTGACTTCA GCAAAGTGTT TCATGGGAAT GAGTCTCTCT GGAAGAACTT 1 140 
CAAGGAACAT CATCAGCTGC AGAGGATTGG GGAGTCAGAG GACTGTGCAG GAATCGTGTC 1200 

381 



WO 02/059377 



CTTCCTGTGC TCTCCAGATG CCAGCTACGT CAACGGGGAG AACATTGCGG TGGCAGGCTA 1260 
CTCCACTCGG CTCTGAGAGG AGTGGGGGCG GCTGCGTAGC TGTGGTCCCA GCCCAGGAGC 1320 
CTGAGGGGGT GTCTAGGTGA TCATTTGGAT CTGGAGCAGA GTCTGCCATT CTGCCAGACT 1380 
AGCAATTTGG GGGCTTACTC ATGCTAGGCT TGAGGAAGAA GAAAAACGCT TCGGCATTCT 1440 



Seq ID NO: 76 Protein sequence: 
Protein Accession #: NP_0O5785 

1 11 21 31 41 51 
1 1 I I I 1 

MLSAVARGYQ GWFHPCARLS VRMSSTGIDR KGVLANRVAV VTGSTSGIGF AIARRLARDG 60 
AHVVISSRKQ QNVDRAMAKL QGEGLSVAOI VCHVGKAEDR EQLVAKALEH CGGVDFLVCS 120 
AG VNPLVGST LGTSEQIWDK 1LS VNVKSPA LLLSQLLPYM ENRRGAV1LV SSIAAYNP W 1 80 
ALGVYNVSKT ALLGLTRTLA LELAPKDIRV NCWPGIIKT DFSKVFHGNE SLWKNFKEHH 240 
QLQRIGESED CAGIVSFLCS PDASYVNGEN IAVAGYSTRL 

SeqIDNO:77 DNA sequence 

Nucleic Acid Accession #: NM_002425 

Coding sequence: 26-1453 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I , ) 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 
AGTCTGCTCT GCCTATCCTC TG AGTGGGGC AGCAAAAGAG GAGG ACTCCA ACAAGGATCT 1 20 
TGCCCAGCAA TACCTAG AAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 1 80 
AAAGGACAGT AATCTCATTG TTAAAA AAAT CCAAGG AATG CAGAAGTTCC TTGGGTTGGA 240 
GGTGACAGGG AAGCTAGACA CTG ACACTCT GG AGGTGATG CGCAAGCCCA GGTGTGGAGT 300 
TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGC ATGCCG AAGTGGAGGA AAACCCACCT 360 
TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 
TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 
AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 
TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGG AGA 600 
TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 
CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 
TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGOC CAGTTCCGCC TTTCGCAAGA 780 
TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 
GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 
GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 960 
TTGGCG AAG A TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CAT TTTGG CC 1020 
CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 
TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 1 140 
AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 
CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 
TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 
GCCTAAGGTT GATGCTGTAT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 
ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAGCTG 1440 
GTTACATTGC TAG GCGAOAT AGGGGGA AG A CAG ATATGGG TGTTTTTAAT AAATCTAATA 1 500 
ATTATTCATC TA ATGTATTA TG AGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTG ACT 1 560 
GAAGAAGATG AGCCTTGCAG ATATCTGCAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1 620 
ACTTGCTTTT GAATTGCACT G AACAG AATT AAGAAATACT CATGTGCAAT AGGTGAG AG A 1 680 
ATGTATTTTC ATAG ATGTGT TATTACTTCC TCAATAA AAA GTTTTATTTT GGGCCTGTTC 1 740 
CTT 

Seq ID NO: 78 Protein sequence: 
Protein Accessions NPJW2416 

1 11 21 31 41 51 

I I I 1 1 1 _ 

MHLAFLVLLC LPVCSAYPLS GAAKEEDSNK DLAQQYLEKY YNLEKDVKQF RRKDSNUVK 60 

KJQGMQKFLG LEVTGKLDTD TLEVMRKPRC GVPDVGHFSS FPGMPKWRKT HLTYRIVNYT 120 

PDLPRDAVDS AIEKALKVWE EVTPLTFSRL YEGEADIMIS FAVKEHGDFY SFDGPGHSLA 180 

HAYPPGPGLY GDIHFDDDEK WTEDASGTNL FLVAAHELGH SLGLFHSANT EALMYPLYNS 240 

FTELAQFRLS QDDVNGIQSL YGPPPASTEE PLVPTKSVPS GSEMPAKCDP ALSFDAISTL 300 

RGEYLFFKDR YFWRRSHWNP EPEFHLISAF WPSLPSYLDA AYEVNSRDTV FIFKGNEFWA 360 

1RGNEVQAGY PRGIHTLGFP PT1RKJDAAV SDKEKKKTYF FAADKYWRFD ENSQSMEQGF 420 

PRLIADDFPG VEPKVDAVLQ AFGFFYFFSG SSQFEFDPNA RMVTHILKSN SWLHC 

Seq ID NO: 79 DNA sequence 

Nucleic Arid Accession* NM_000493.I 

Coding sequence: 1-2043 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

ATGCTGCCAC AAATACCCTT TTTGCTOCTA GTATCCTTGA ACTTGGTTCA TGOAGTGTTT 60 
TACGCTGAAC GATACCAAAC GCCCACAGGC ATAAAAGGCC CACTACCCAA CACCAAGACA 120 
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CAGTTCTTCA TTCCCTACAC CATAAAGAGT AAAGGTATAG CAGTAAGAGG AGAGCAAGGT 180 
ACTCCTGGTC C ACC AGGCCC TGCTGG ACCT CGAGGGCACC CAGGTCCTTC TGG ACCACCA 240 
GG AAAACCAG GCTACGO AAG TCCTGGACTC CAAGGAGAGC CAGGGTTGCC AGG ACCACCG 300 
GGACCATCAG CTGTAGGGAA ACCAGGTGTG CCAGGACTCC CAGGAAAACC AGGAGAGAGA 360 
GGACCATATG GACCAAAAGG AGATGTTGGA CCAGCTGGCC TACCAGGACC CCGGGGCCCA 420 
CCAGGACCAC CTGGAATCCC TGGACCGGCT GGAATTTCTG TGCCAGGAAA ACCTGGACAA 480 
CAGGGACCCA CAGGAGCCCC AGGACCCAGG GGCTTTCCTG GAGAAAAGGG TGCACCAGGA 540 
GTCCCTGGTA TGAATGGACA GAAAGGGGAA ATGGGATATG GTGCTCCTGG TCGTCCAGGT 600 
GAGAGGGGTC TTCCAGGCCC TCAGGGTCCC ACAGGACCAT CTGGCCCTCC TGGAGTGGGA 660 
AAAAGAGGTG AAAATGGGGT TCCAGG ACAG CCAGGCATCA AAGGTGATAG AGGTTTTCCG 720 
GGAGAAATGG GACCAATTGG CCCACCAGGT CCCCAAGGCC CTCCTGGGGA ACGAGGGCCA 780 
G AAGGCATTG GAAAGCCAGG AGCTGCTGGA GCCCCAGGCC AGCCAGGGAT TCCAGG AACA 840 
AAAGGTCTCC CTGGGGCTCC AGGAATAGCT GGGCCCCCAG GGCCTCCTGG CTTTGGGAAA 900 
CCAGGCTTGC CAGGCCTGAA GGGAGAAAGA GGACCTGCTG GCCTTCCTGG GGGTCCAGGT 960 
GCCAAAGGGG AACAAGGGCC AGCAGGTCTT CCTGGGAAGC CAGGTCTGAC TGGACCCCCT 1020 
GGGAATATGG GACCCCAAGG ACCAAAAGGC ATCCCGGGTA GCCATGGTCT CCCAGGCCCT 1080 
AAAGGTG AGA CAGGGCCAGC TGGGCCTGCA GG ATACCCTG GGGCTAAGGG TGAAAGGGGT 1 140 
TCCCCTGGGT CAGATGGAAA ACCAGGGTAC CCAGGAAAAC CAGGTCTCGA TGGTCCTAAG 1200 
GGTAACCCAG GGTTACCAGG TCCAAAAGGT GATCCTGGAG TTGGAGGACC TCCTGGTCTC 1260 
CCAGGCCCTG TGGGCCCAGC AGGAGCAAAG GGAATGCCCG GACACAATGG AGAGGCTGGC 1320 
CCAAG AGGTG CCCCTGGAAT ACCAGGTACT AGAGGCCCTA TTGGGCCACC AGGCATTCCA 1380 
GGATTCCCTG GGTCTAAAGG GGATCCAGGA AGTCCCGGTC CTCCTGGCCC AGCTGGCATA 1440 
GCAACTAAGG GCCTCAATGG ACCCACCGGG CCACCAGGGC CTCCAGGTCC AAGAGGCCCC 1500 
TCTGGAGAGC CTGGTCTTCC AGGGCCCCCT GGGCCTCCAG GCCCACCAGG TCAAGCAGTC 1560 
ATGCCTGAGG GTTTTATAAA GGCAGGCCAA AGGCCCAGTC TTTCTGGGAC CCCTCTTGTT 1620 
AGTGCCAACC AGGGGGTAAC AGGAATGCCT GTGTCTGCTT TTACTGTTAT TCTCTCCAAA 1680 
GCTTACCCAG CAATAGGAAC TCCCATACCA TTTGATAAAA TTTTGTATAA CAGGCAACAG 1740 
CATTATGACC CAAGGACTGG AATCTTTACT TGTCAGATAC CAGGAATATA CTATTTTTCA 1800 
TACCACGTGC ATGTGAAAGG GACTCATGTT TGGGTAGGCC TGTATAAGAA TGGCACCCCT 1 860 
GTAATGTACA CCTATGATGA ATACACCAAA GG CTACCTGG ATCAGGCTTC AGGGAGTGCC 1920 
ATCATCG ATC TCACAGAAAA TGACCAGGTG TGGCTCCAGC TTCCCAATGC CGAGTCAAAT 1980 
GGCCTATACT CCTCTGAGTA TGTCCACTCC TCTTTCTCAG GATTCCTAGT GGCTCCAATG 2040 
TGAGTACACC CCACAGAGCT AATCTAAATC TTGTGCTAGA AAAAGCATTC TC TAAC TCTA 2100 
CCCCACCCTA CAAAATGCAT ATGGAGGTAG GCTGAAAAGA ATGTAATTTT TATTTTCTGA 2160 
AATACAG ATT TGAGCTATCA GACCAACAAA CCTTCCCCCT GAAAAGTG AG CAGCAACGTA 2220 
AAAACGTATG TGAAGCCTCT CTTGAATTTC TAGTTAGCAA TCTTAAGGCT CTTTAAGGTT 2280 
TTCTCCAATA TTAAAAAATA TCACCAAAGA AGTCCTGCTA TGTTAAAAAC AAACAACAAA 2340 
AAACAAAGCA ACAAAAAAAA AAATTAA AAA A AAAAACAG A AATAGAGCTC TAAGTTATGT 2400 
G AAATTTGAT TTGAGAAACT CGGCATTTCC TTTTTAAAAA AGCCT GTTTC TAACTATGAA 2460 
TATGAGAACT TCTAGGAAAC ATCCAGGAGG TATCATATAA CTTTGTAGAA CTTAAATACT 2520 
TGAATATTCA AATTTAAAAG ACACTGTATC CCCTAAAATA TTTCTGATGG TGCACTACTC 2580 
TG AGGCCTGT ATGGCCCCTT TCATCAATAT CTATTCAAAT ATACAGGTGC ATATATACTT 2640 
GTTAAAGCTC TTATATAAAA AAGCCCCAAA ATATTG AAGT TCATCTGAAA TGCAAGGTGC 2700 
TTTCATCAAT G AACCTTTTC AAAACTTTTC TATG ATTGCA GAGAAGCTTT TTATATACCC 2760 
AGCATAACTT GG AAACAGGT ATCTG ACCTA TTCTTATTTA GTTAACACAA GTGTGATTAA 2820 
TTTG ATTTCT TTAATTCCTT ATTGAATCTT ATGTGATATG ATTTTCTGGA TTTACAGAAC 2880 
ATTAGCACAT GTACCTTGTG CCTCCCATTC AAGTGAAGTT ATAATTTACA CTGAGGGTTT 2940 
CAAAATTCGA CTAGAAGTGG AGATATATTA TTTATTTATG CACTGTACTG TATTTTTATA 3000 
TTGCTGTTTA AAACTTTTAA GCTGTGCCTC ACTTATTAAA GCACAAAATG TTTTACCTAC 3060 
TCCTTATTTA CGACACAATA AAATAACATC AATAGATTTT TAGGCTGAAT TAATTTGAAA 3120 
GCAGC AATTT GCTGTTCTCA ACCATTCTTT CAAGGCTTTT CATTCGACAC AATAAAATAA 31B0 
CATCAATAGA TTTTTAGG 

Scq ID NO: 80 Protein sequence: 
Protein Accession NP_000484.I 



1 11 21 31 41 51 

I I I I I I 

MLPQIPFLLL VSLNLVHGVF YAERYQTPTG HCGPLPNTKT QFFIPYTIKS KGIAVRGEQG 60 
TPGPPGPAGP RGHPGPSGPP GKPGYGSPGL QGEPGLPGPP GPSAVGKPGV PGLPGKPGER 120 
GPYGPKGDVG PAGLPGPRGP PGPPGIPGPA GISVPGKPGQ QGPTGAPGPR GFPGEKGAPG 180 
VPGMNGQKGE MGYGAPGRPG ERGLPGPQGP TGPSGPPGVG KRGENGVPGQ PGKGDRGFP 240 
GEMGP1GPPG PQGPPGERGP EGIGKPGAAG APGQPGIPGT KGLPGAPGIA GPPGPPGFGK 300 
PGLPGLKGER GPAGLPGGPG AKGEQGPAGL PGKPGLTGPP GNMGPQGPKG IPGSHGLPGP 360 
KGETGPAGPA GYPGAKGERG SPGSDGKPGY PGKPGLDGPK GNPGLPGPKO DPGVGGPPGL 420 
PGPVGPAGAK GMPGHNGEAG PRGAPGIPGT RGPIGPPG1P GFPGSKGDPG SPGPPGPAGI 480 
ATKGLNGPTG PPGPPGPRGP SGEPGLPGPP GPPGPPGQAV MPEGFKAGQ RPSLSGTPLV 540 
SANQGVTGMP VSAFTVILSK AYPAIGTPIP FDKILYNRQQ HYDPRTGIFT CQ1PGIYYFS 600 
YHVHVKOTHV W VGLYKNGTP VMYTYDEYTK GYLDQASGSA IIDLTENDQV WLQLPNAESN 660 
GLYSSEYVHS SFSGFLVAPM 

Seq ID NO: 81 DNA sequence 

Nucleic Acid Accession #: NM 000786 

Coding sequence: 332-1861 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 
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l.~ I I I I I 

CGCGATTCTC AGGGATTGAT CCGCCTCTTC AGGTAAGTTA TCTTCCGGCC CCGTACCACT 60 
GTGCCACAGG CGCAGCCCGC TTCCTCAGGT GCCCTATCCC GCGCAGAAG A CCACGGCTTC 120 
ACAGAGTGTT ATTTAAOGGC GTGGCCAGCG GAACATCCCG CCCCATTCTG TGACGCACGO 180 
GGTGGCGCGC GTGGGACCCG AGGGGTGGGG CTGGGTTTAG TAGGAGACCT GGGGCAAGGC 240 
CCCCTGTGGA CGACCATCTG CCAGCTTCTC TCGTTCCGTC GATTGGGAGG AGCGGTGGCG 300 
ACCTCGGCCT TCAGTGTTTC CGACGGAGTG AATGGCGGCG GCGGCTGGGA TGCTGCTGCT 360 
GGGCTTGCTG CAGGCGGGTG GGTCGGTGCT GGGCCAGGCG ATGGAGAAGG TGACAGGCGG 420 
CAACCTCTTG TCCATGCTGC TGATCGCCTG CGCCTTCACC CTCAGCCTGG TCTACCTGAT 480 
CCGTCTGGCC GCCGGCCACC TGGTCCAGCT GCCCGCAGGG GTGAAAAGTC CTCCATACAT 540 
TTTCTCCCCA ATTCCATTCC TTGGGCATGC CATAGCATTT GGG AAAAGTC CAATTGAATT 600 
TCTAGAAAAT GCATATGAGA AGTATGGACC TGTATTTAGT TTTACCATGG TAGGCAAGAC 660 
ATTTACTTAC CTTCTGGGGA GTGATGCTGG TGCACTGCTT TTTAATAGTA AAAATG AAGA 720 
CCTGAATGCA GAAGATGTCT ACAGTCGCCT GACAACACCT GTGTTTGGGA AGGGAGTTGC 780 
ATACGATGTG CCTAATCCAG TTTTCTTGG A GCAGAAGAAA ATGTTAAAAA GTGGCCTTAA 840 
CATAGCCCAC TTTAAACAGC ATGTTTCTAT AATTGAAAAA GAA ACAAAGG AATACT TTGA 900 
GAGTTGGGG A G AAAGTGG AG AAAAAAATGT GTTTG AAGCT CTTTCTGAGC TCATAATTTT 960 
AACAGCTAGC CATTGTTTGC ATGGAAAGGA AATCAGAAGT CAACTCAATG AAAAGGTAGC 1020 
ACAGCTGTAT GCAGATTTGG ATGGAGGTTT CAGCCATGCA GCCTGGCTCT TACCAGGTTO 1080 
GCTGCCTTTG CCTAGTTTCA GACGCAGGGA CAGAGCTCAT CGGGAAATCA AGGATATTTT 1 140 
CTATAAGGCA ATCCAGAAAC GCAGACAGTC TCAAGAAAAA ATTGATGACA TTCTCCAAAC 1200 
TTTACTAGAT GCTACATACA AGG ATGGGCG f TCCTTTG ACT GATGATGAAG TAGCAGGGAT 1260 
GCTTATTGGA TTACTCTTGG CAGGGCAGCA TACATCCTCA ACTACTAGTG CTTGGATGGG 1320 
CTTCTTTTTG GCCAGAGACA AAACACTTCA AAAAAAATGT TATTTAGAAC AGAAAACAGT 1380 
CTGTGGAGAG AATCTGCCTC CTTTAACTTA TGACCAGCTC AAGGATCTAA ATTTACTTG A 1440 
TCGCTGTATA AAAG AAACAT TAAGACTTAG ACCTCCTATA ATG ATCATG A TGAGAATGGC 1500 
CAGAACTCCT CAGACTGTGG CAGGGTATAC CATTCCTCCA GGACATCAGG TGTGTGTTTC 1560 
TCCCACTGTC AATCAAAGAC TTAAAGACTC ATGGGTAGAA CQCCTGGACT TTAATCCTGA 1620 
TCGCTACTTA CAGGATAACC CAGCATCAGG GGAAAAGTTT GCCTATGTGC CATTTGGAGC 1680 
TGGGCGTCAT CGTTGTATTG GGGAAAATTT TGCCTATGTT CAAATTAAGA CAATTTGGTC 1740 
CACTATGCTT CGTTTATATG AATTTGATCT CATTGATGGA TACTTTCCCA CTGTGAATTA 1800 
TACAACTATG ATTCACACCC CTGAGAACCC AGTTATCCGT TACAAACGAA GATCAAAAIS I860 
MAAAGGTTG CAAGGAACGA ATATATGTGA TTATCACTGT AAGCCACAAA GGCATTCGAA 1920 
GAG A ATG A AG TGTACAAAAC AACTCTTGTA GTTTACTGTT TTTTTAAGTG TGTAATTCTA 1980 
AAAGCCAGTT TATG ATTTAG GATTTTGTTA ACTGAATGGT TCTATCAAAT ATAATAGCAT 2040 
TTGACACATT TTCTAATAGT TATGATACTT ATACATGTGC TTTCAGGAAG TTCCTTGGTG 2100 
AAACAATTGT TGAGGGGGGA TCTAGGTAAT TGGCAGATTC TAAATAATAT AATTTCCAGA 2160 
TAGTAATTTT AAG AGTACTC ATCGCTCTTG CCAAAT AAGT TCAGGGTATT CAAATCTTGG 2220 
ACTAGTCCTG CAAGGTATAA AGAATAAAAA TCCCAGTGAG ATACTTGG AA ACCACAGTTT 2280 
ATTATTATTT ATCTGGGCAA TTATTGTGTG TGTGAGGATG GAAGGGTAGG GAATAATCGA 2340 
ACATCTAAAG CCTTGAATAA GAGAATACTA ATTGTTTTGG TATGATGATA CTCAGAAATG 2400 
GAGATATTAT AGGAAAAAGA AATCCTTTGG AATTTTAACT AAAATCACTG CATATGGGAA 2460 
ATTAAGAGAT CCAGGACCAT ATTTG ATAAG AGTTCCTAAA AATAATGTAA TTATTAATGC 2520 
TAAAGACTGC TCATGTATCT TGATCTAATT ACTAAATAAA TTACATATTT ATTTACCTGA 2580 
TAAATATGTA TCTAGTTCTA CAAGGTCACA TTTATGTGGA AGTCCAAA GT CAA GTCCTTA 2640 
GGGG ATAATT TTGTTTTGGG CTCAGTTGTT CCCTGCTTCC 1111111111 TTTTTTTTTT 2700 
TTGAGATGG A GTCTCGCTCT GTTGCCCAGG CTGG AGTGCA GTGGTGCG AT CTCAGCTCAC 2760 
TGCATCCTCT GCCTCCCGGG TTCAAGCAAT TCTCTGCCTC AGCCTCCCAA GTAGTTGGGA 2820 
TTACAGGCAC CTGCCACCAT GCCTGGCTAA TTTTTTG TAT TTTTAGTAGA G ACGGGGGTT 2880 
TCACTATGTT GGCTAGGCTG GTCTTGAACT CCTG AGCCTC GTGAGTCCAC CCGCCTTGGC 2940 
CTCCCAAAGT GCTGGG ATTA CAGGCATG AG CCACCGCACC TGGCCTTCCC TGCTTCCTCT 3000 
CTAGAATCCA ATTAGGG ATG TTTGTTACTA CTCATATTGA TTAAAACAGT TAACAAACTT 3060 
TTTTCTTTTT AAAATGTGAG ATCAGTGAAC TCTGGTTTTA AGATAATCTG AAACAAGGTC 3120 
CTTGGGAGTA ATAAAATTGG TCACATTCTG TAAAGCACAT TCTGTTTAGG AATCAACTTA 3 180 
TCTCAAATTG TAACTCGGGG CCTAACTATA TG AG ATGGCT GAAAAAATAC CACATCGTGT 3240 
GTTTTCACTA GGTGATGCCA AAATATTTTG CTTTATGTAT ATTACAGTTC TTTTTAAAAC 3300 
ACTGGAAGAC TCATGTTAAA CTCTAATTGT GAAGGCAGAA TCTCTGCTAA TTTTTCAGAT 3360 
TAAAATTCTC TTTGAAAAAA T 

Seq ID NO: 82 Protein sequence; 
Protein Accession #: NP_000777 

1 11 21 31 41 51 
I I I I I I 

MAAAAGMLLL GLLQAGGSVL GQAMEKVTGG NLLSMLLIAC AFTLSLVYLI RLAAGHLVQL 60 
PAGVKSPPYI FSPIPFLGHA IAFGKSPIEF LENAYEKYGP VFSFTMVGKT FTYLLGSDAA 120 
ALLFNSKNED LNAEDVYSRL TTPVFGKG VA YDVPNP VFLE QKKMLKSGLN IAHFKQHVSI 1 80 
1EKETKEYFE SWGESGEKNV FEALSELIIL TASHCLHGKE IRSQLNEKVA QLYADLDGGF 240 
SHAAWLLPG W LPLPSFRRRD RAHREDCDEF YKAIQKRRQS QEKIDDILQT LLDATYKDGR 300 
PLTDDEVAGM LIGLLLAGQH TSSTTSAWMG FFLARDKTLQ KKCYLEQKTV CGENLPPLTY 360 
DQLKDLNLLD RC1KETLRLR PPIMIMMRMA RTPQTVAGYT IPPGHQVCVS PTVNQRLKDS 420 
WVERLDFNPD RYLQDNPASG EKFAYVPFGA GRHRCIGENF AYVQDCTIWS TMLRLYEFDL 480 
IDGYFPTVNY TTMIHTPENP V1RYKRRSK 

Seq ID NO: 83 DNA sequence 

Nucleic Acid Accession*: NM 006551.2 

Coding sequence: 64-336 (underlined sequences correspond to start and stop codons) 
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1 .11 21 31 41 51 
III I I I 

AATTCTAGAA GTCCAAATCA CTCATTGTTT GTGAAAGCTG AGCTCACAGC AAAACAAGCC 60 
ACCATfiAAGC TGTCGGTGTG TCTCCTGCTG GTCACGCTGG CCCTCTGCTG CTACCAGGCC 120 
AATGCCGAGT TCTGCCCAGC TCTTGTTTCT GAGCTGTTAG ACTTCTTCTT CATTAGTGAA 180 
CCTCTGTTCA AGTTAAGTCT TGCCAAATTT GATGCCCCTC CGGAAGCTGT TGCAGCCAAG 240 
TTAGGAGTGA AG AGATGCAC GGATCAG ATG TCCCTTCAGA AACG AAGCCT CATTGCGGAA 300 
GTCCTGGTGA AAATATTG AA GAAATGTAGT GTGTGACATG TAAAAACTTT CATCCTGGTT 360 
T CCAC TGTCT TTCAATGACA CCCTGATCTT CACTGCAGAA TGTAAAGGTT TCAACGTCTT 420 
GCTTTAATAA ATCACTTGCT CTAC 

Sea ID NO: 84 Protein sequence: 
Protein Accession #: NPJJ06542.1 

1 11 21 ,31 41 51 
1)1111 

MKLSVCLLLV TLALCCYQAN AEFCPALVSE LLDFFFISEP LFKLSLAKFD APPEAVAAKL 60 
GVKRCTDQMS LQKRSUAEV LVK1LKKCSV 

Seq ID NO: 85 DNA sequence 

Nucleic Acid Accession #: . NM 002317.1 

Coding sequence: 231-1484 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGGCCAGGAC TGAGAAAGGG GAAAGGGAAG GGTGCCACGT CCGAGCAGCC GCCTTGACTG 60 
GGGAAGGGTC TGAATCCCAC CCTTGGCATT GCTTGGTGGA GACTGAGATA CCCX3TGCTCC 120 
GCTCGCCTCC TTGGTTGAAG ATTTGTCCTT CCCTCACGTG ATTTGAGCCC CGTTTTTATT 180 
TTCTGTGAGC CACGTCCTCC TCGAGCGGGG TCAATCTGGC AAAAGGAGTG ATGCGCTTCO 240 
CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC TGCGCCCCTC 300 
CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC GCCTGGCGCC 360 
AGCAG ATCCA ATGGGAG AAC AACGGGCAGG TGTTCAGCTT GCTGAGCCTG GGCTCACAGT 420 
ACCAGCCTCA GCGCCGCCGG GACCCGGGCG CCGCCGTCCC TGGTGCAGCC AACGCCTCCG 480 
CCCAGCAGCC CCGCACTCCG ATCCTGCTGA TCCGCGACAA CCGCACCGCC GCGGGGCGAA 540 
CGCGGACGGC CGGCTCATCT GGAGTCACCG CTGGCCGCCC CAGGCCCACC GCCCGTCACT 600 
GGTTCC AAGC TGGCTACTCG ACATCTAGAG CCCGCG AAGC TGGGCCCTCG CGCGCGGAGA 660 
ACCAG ACAGC GCCGGGAGAA GTTCCTGCTC TCAGTAACCT GCGGCCGCCC AGCCGCGTGG 720 
ACGGCATGGT GGGCGACGAC CCTTACAACC CCTACAAGTA CTCTGACGAC AACCCTTATT 780 
ACAACTACTA CGATACTTAT G AAAGGCCCA GACCTGGGGG CAGGTACCGG CCCGGATACG 840 
GCACTGGCTA CTTCCAGTAC GGTCTCCCAG ACCTGGTGGC CGACCCCTAC TACATCCAGG 900 
CGTCCACGTA CGTGCAGAAG ATGTCCATGT ACAACCTGAG ATGCGCGGCG GAGGAAAACT 960 
GTCTGGCCAG TACAGCATAC AGGGCAG ATG TCAGAGATTA TGATCACAGG GTGCTGCTCA 1020 
GATTTCCCCA AA GAG 1X3 AAA AACCAAGGGA CATCAGATTT CTTACCCAGC CGACCAAGAT 1080 
ATTCCTGGGA ATGGCACAGT TGTCATCAAC ATTACCACAG TATGGATG AG TTTAGCCACT 1 140 
TGTACCTGCT TGATGCCAAC ACCCAGAGGA GATGGGCTGA AGGCCACAAA GCAAGTTTCT 1200 
GTCTTGAAGA CACATCCTGT GACTATGGCT ACCACAGGCG ATTTGCATGT ACTGCACACA 1260 
CACAGGGATT GAGTCCTGGC TGTTATGATA CCTATGGTGC AGACATAGAC TGCCAGTGOA 1320 
TTGATATTAC AGATGTAAAA CCTGGAAACT ATATCCTAAA GGTCAGTGTA AACCCCAGCT 1380 
ACCTGGTTCC TGAATCTGAC TATACCAACA ATGTTGTGCG CTGTGACATT CGCTACACAG 1440 
GACATCATGC GTATGCCTCA GGCTGCACAA TTTCACCGTA TTAGAAGGCA AAGCAAAACT 1500 
CCCAATGGAT AAATCAGTGC CTGGTGTTCT GAAGTGGGAA AAAATAGACT AACTTCAGTA 1560 
GGATTTATGT ATTTTGAAAA AGAGAACAGA AAACAACAAA AGAATTTTTG TTTGGACTGT 1620 
TTTCAATAAC AAAGCACATA ACTGGATTTT GAACGCTTAA GTCATCATTA CTTGGGAAAT 1680 
TTTTAATGTT TATTATTTAC ATCACTTTGT GAATTAACAC AGTGTTTCAA TTCTGTAATT 1740 
ACATATTTGA CTCTTTCAAA AAAAAAAAAA AAAAAAAAAA 

Seq ID NO: 86 Protein sequence: 
Protein Accession #: NP 002308.1 

1 11 21 31 41 51 

111 111 

MRFAWTVLLL GPLQLCALVH CAPPAAGQQQ PPREPPAAPG AWRQQIQWEN NGQVFSLLSL 60 
GSQYQPQRRR DPGAAVPGAA NASAQQPRTP ILLIRDNRTA AGRTRTAGSS GVTAGRPRPT 120 
ARKWFQAGYS TSRAREAGPS RAENQTAPGE VPALSNLRPP SRVDGMVGDD PYNPYKYSDD 180 
NPYYNYYDTY ERPRPGGRYR PGYGTGYFQY GLPDLVADPY YIQASTYVQK MSMYNLRCAA 240 
EENCLASTAY RADVRDYDHR VLLRFPQRVK NQGTSDFLPS RPRYSWEWHS CHQHYHSMDE 300 
FSHLYLLDAN TQRRWAEGHK ASFCLEDTSC DYGYHRRFAC TAHTQGLSPG CYDTYGADID 360 
CQW1DITDVK PGNYILKVSV NPSYLVPESD YTNNVVRCDI RYTGHHAYAS GCTISPY 

Seq ID NO: 87 DNA sequence 

Nucleic Acid Accession P. NM 006419.1 

Coding sequence: 9 1 -420 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 51 

| I I I I I 

TTCGGCACTT GGGAGAAGAT GTTTGAAAAA ACTGACTCTG CTAATGAGCC TGGACTCAGA 60 
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GCTCAAGTCT G AACTCTACC TCCAG ACAGA ATG AAGTTCA TCTCG ACATC TCTGCTTCTC 120 
ATGCTGCTGG TCAGCAGCCT CTCTCCAGTC CAAGGTGTTC TGGAGGTCTA TTACACAAGC 180 
TTGAGGTGTA GATGTGTOCA AGAGAGCTCA GTCTTTATCC CTAGACGCTT CATTGATCGA 240 
ATTCAAATCT TGCCCCGTGG GAATGGTTGT CCAAGAAAAG AAATCATAGT CTGGAAGAAG 300 
AACAAGTCAA TTGTGTGTGT GGACCCTCAA GCTGAATGGA TACAAAGAAT G ATGG AAGTA 360 
TTG AG AAA AA GAAGTTCTTC AACTCTACCA GTTCCAGTGT TTAAGAGAAA GATTCCCTQA 420 
TGCTG ATATT TCCACTAAG A ACACCTGCAT TCTTCCCTTA TCCCTOCTCT GGATTTTAGT 480 
TTTGTGCTTA GTTAAATCTT TTCCAGGGAG AAAGAACTTC CCCATACAAA TAAGGCATGA 540 
GGACTATGTG AAAAATAACC TTGCAGGAGC TGATGGGGCA AACTCAAGCT TCTTCACTCA 600 
CAGCACCCTA TATACACTTG G AGTTTGCAT TCTTATTCAT CAGGGAGGAA AGTTTCTTTG 660 
A AAATAOTTA TTCAGTTATA AGTAATACAG GATTATTTTG ATTATATACT TGTTGTTTAA 720 
TGTTTAAAAT TTCTTAGAAA ACAATGGAAT G AG AATTTAA GCCTCAAATT TGAACATGTG 780 
GCTTGAATTA AGAAGAAAAT T ATGG CAT AT ATTAAAAGCA GGCTTCTATG AAAGACTCAA 840 
AAAGCTGCCT GGGAGGCAGA TGGAACTTGA GCCTGTCAAG AGGCAAAGGA ATCCATGTAG 900 
TAG ATATCCT CTGCTTAAAA ACTCACTACG GAGG AG AATT AAGTCCTACT TTTAAAGAAT 960 
TTCTTTATAA AATTTACTGT CTAAGATTAA TAGCATTCGA AGATCCCCAG ACTTCATAGA 1020 
ATACTCAGGG AAAGCATTTA AAGGGTGATG TACACATGTA TCCTTTCACA CATTTGCCTT 1080 
GACAAACTTC TTTCACTCAC ATCTTTTTCA CTGACTTTTT TTGTGGGGGC GGGGCCGGGG 1140 
GGACTCTGGT ATCTAATTCT TTAATGATTC CTATAAATCT AATGACATTC AATAAAGTTG 1200 
AGCAAACATT TTACTT 

i 

Sea ID NO: 88 Protein sequence: ( 
Protein Accession*: NP_0064I0.1 

1 II 21 31 41 51 
1 I I I I I 

MKFISTSLLL MLLVSSLSPV QGVLEVYYTS LRCRCVQESS VFEPRRFIDR IQILPRGNGC 60 
PRKEIIVWKK NKSIVC VDPQ AEWIQRMMEV LRKRSSSTLP VPVFKRKIP 

Seq ID NO: 89 DNA sequence 

Nucleic Acid Accession #: NMJW2652 

Coding sequence: 37-477 (underlined sequences correspond to start and stop codons) 

1 It 21 31 41 51 
I I I I I I 

CTTCTCTGGG ACACATTGCC TTCTGTTTTC TCCAGCATGC GCTTGCTCCA GCTCCTGTTC 60 
AGGGCCAGCC CTGCCACCCT GCTCCTGGTT CTCTGCCTGC AGTTGGGGGC CAACAAAGCT 120 
CAGGACAACA CTCGGAAGAT CATAATAAAG AATTTTGACA TTCCCAAGTC AGTACGTCCA 180 
AATGACG AAG TCACTGCAGT GCTTGCAGTT CAAACAGAAT TO AAAGAATG CATGGTGGTT 240 
AAAACTTACC TCATTAGCAG CATCCCTCTA CAAGGTGCAT TTAACTATAA GTATACTGCC 300 
TGCCTATGTG ACG ACAATCC AA AAACCTTC TACTGGG ACT TTTACACCAA CAGAACTGTG 360 
CAAATTGCAG CCGTCOTTG A TGTTATTCGG GAATTAGGCA TCTGCCCTGA TGATGCTGCT 420 
GTAATCCCCA TCAAAAACAA CCGGTTTTAT ACTATTGAAA TCCTAAAGGT AGAATAATGG 480 
AAGCCCTGTC TGTTTGCCAC ACCCAGGTGA TTTCCTCTAA AGAAACTTGG CTGGAATTTC 540 
TGCTGTGGTC TATAAAATAA ACTTCTTAAC ATGCTT 

. » 
Seq ID NO: 90 Protein sequence: 
Protein Accession #: NP 002643.1 

1 II 21 31 41 51 

I I I I I I 

MRLLQLLFRA SPATLLLVLC LQLG ANKAQD NTRKIIIKNF DIPKSVRPND EVTAVLAVQT 60 
ELKECMVVKT YLISSIPLQG AFNYKYTACL CDDNPKTFYW DFYTNRTVQI AAWDVIREL 120 
GICPDDAAVI PIKNNRFYTI EILKV 

Seq ID NO: 9 1 DNA sequence 

Nucleic Acid Accession #: AK000341 

Coding sequence: 85-975 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GATAGCGCCG GGCAGAGGGA CCCGGCTACC CTGGACAGCG CATCGCCGCC CGCCCGQGTC 60 
GCCGCGCCAC AGCCGCTGCG GATCATGGAA CATCTAAAGG CCTTTGATGA TGAAATCAAT 120 
G Cmill GG ACAATATGTT TGG ACCGCG A G ATTCTCG AG TCAGAGGGTG GTTCACGTTG 1 80 
GACTCTTACC TTCCTACCTT TTTTCTTACT GTCATGTATC TGCTCTCAAT ATGGCTGGGT 240 
AACAAGTATA TG AAGAACAG ACCTGCTCTT TCTCTCAGGG GTATCCTCAC CTTGTATAAT 300 
CTTGGAATCA CACTTCTCTC CGCGTACATG CTGGCAGAGC TCATTCTCTC CACTTGGGAA 360 
GGAGGCTACA ACTTACAGTG TCAAGATCTT ACCAGCGCAG GGGAAGCTGA CATC CGGG TA 420 
GCCAAGGTGC TTTGGTGGTA CTATTTCTCC AAATCAGTAG AGTTCCTGGA CACAATTTTC 480 
TTCGTTTTGC GGAAAAAAAC GAGTCAGATT ACTTTTCTTC ATGTATATCA TCATOCTT CT 540 
ATGTTTAACA TCTGGTGGTG TGTCTTGAAC TGGATACCTT GTGGACAAAG TTTCTTTGGA 600 
CCAACACTG A ACAGTTTTGT CCACATTCTT ATGTACTCCT ACT ATGG ACT TTCTGTGTTT 660 
CCATCTATGC ACAAGTATCT TTGGTGGAAG AAATATCTCA CACAGGCTCA GCTGGTGCAG 720 
TTCGTGCTCA CCATCACGCA CACCATG AGC GCCGTCGTGA AACCGTGTCG CTTC CCCTT C 780 
GGTTGTCTCA TCTTCCAGTC ATCTTATATG CTAACGTTAG TCATCCTCTT CTTAAATTTT 840 
TATGTTCAGA CATACCGAAA AAAGCCAATG AAGAAAGATA TGCAAGAGCC ACCTGCAGGG 900 
AAAG AAGTGA AG AATGGTTT TTCCAAAGCC TACTTCACTG CAGCAAATGG AGTGATGAAC 960 
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AAGAAAGCAC AATAAAAATG AGTAACAGAA AAAGCACATA TACTAGCCTA ACAGATTGGC 1020 
TTGTTTTAAA GCAAAGACTG AATTGAAGGT TACATGTTTT AGGATAAACT AATTTCTTTT 1080 
GAGTTCATAA ATCATTTGTA CCCAGAATGT ATTAATATAT TGCTATTAGG TTAATCTOTT 1 140 
AACTG AATGC TTTG ATCAGC ATTGAGGTGA TGCTCACCTC CGAGGACCTC AGAACTGGTG 1200 
CAGCTTCTCT CTCCCTCCCT CCCACAGACT GAACCTTTCG CCAGAAGCTG TCCTTATAAC 1260 
GCCTTATACG CATACACAGC CAGGAAACGT GGAGCATTGT TTCTCACAGA GAGTCTCCAA 1320 
ATAAAAAGGG TTTTGTTCAG ATTAAAATGT TT AC A AC AAA ATGTTAATTA TATTCTAAAT 1380 
ACAGGGTATG TTCTAATCTA TATTAAGCAA TAATGCCAGT GCATAATCAT TCCATTTGTT 1440 
CCTTTAGCAA TCAACCCCAG AAAATATTAA AATGGGATCA TACACAGAAG ATAGAAAAAT 1500 
CTAGCAAAAC TTCTCTTTCT GTAAGCCAGA GTCTTGTCTA TCAGATTCCC ACAACCACTC 1560 
CTGATTCTAA ATTTAGTGAT ATGGTAATGA AATTGGTATT TATTTTAAAT ATTAGTTATT 1620 
CTAAGGAGAA AAAAATGCTT CTGCAAGATT TTCATAATTC AGGGGCTGTG GATAGGATTG 1680 
TTCCTCTGTT TCCCTAATCA TTCATCTGTT CATGTCTCCC TCTTGTGCCA GTCAGCCTAG 1740 
GTTATACAGA TGCCATGCTC CACACCACGA GCAGTGTACA AATCTGGCTG CCCGTTTACT 1800 
TTCTGAGCAA GCACTGGAGT CCACTCCGAC CTTTTTCTTT GAACATGCAT GCTGCTGGAA 1 860 
TATGTATAAA TCAGAACTAG CAGAAGTAGC AGAGTGATGG GAGCAAAATA GGCACTGAAT 1920 
TCGTCAACTC TTTTTTGTGA GCCTACTTGT GAATATTACC TCAGATACCT GTTGTCACTC 1980 
TTCACAGGTT ATTTAAGTTC TTGAAGCTGG GAGGAAAAAG ATGGAGTAGC TTGGAAAGAT 2040 
TCCAGCACTG AGCCGTGAGC CGGTCATGAG CCACGATAAA AAATGCCAGT TTGGCAAACT 2100 
CAGCACTCCT GTTCCCTGCT CAGGTATATG CGATCTCTAC TGAGAAGCAA GCACAAAAGT 2160 
AGACCAAAGT ATTAATG AGT ATTTCCTTTC TCCATAAGTG CAGGACTGTT ACTCACTACT 2220 
AAACTCTACC AAGAATGGAA ACCAAGAATA TTTTCTGAAG ATTTTTTTGA AGATTAATTT 2280 
ATACCCTATA AAATAAAACT TGTTAGCTTC GATG AAGTCA AAAAAAAAAA AAAAAAAAAA 

Scq ID NO: 92 Protein sequence; 
Protein Accession #: BAA91096.1 

1 11 21 31 41 51 

I I I I I I 

MEHLKAFDDE INAFLDNMFG PRDSRVRGWF TLDSYLPTFF LTVMYLLSIW LGNKYMKNRP 60 
ALSLRGILTL YNLGITLLSA YMLAELILST WEGGYNLQCQ DLTSAGEADI RVAKVLWWYY 120 
FSKS VEFLDT IFFVLRKKTS QITFLHVYHH ASMFNIWWCV LNWIPCGQSF FGPTLNSFVH 180 
ILMYSYYGLS VFPSMHKYLW WKKYLTQAQL VQFVLTITHT MSAWKPCGF PFGCLIFQSS 240 
YMLTL VILFL NFYVQTYRKK PMKKDMQEPP AGKE VKNGFS KAYFT AANG V MNKKAQ 

SeqIDNO:93 DNA sequence 

Nucleic Acid Accession #: NM_000044 

Coding sequence: 1 1 1 5-3874 (underlined sequences correspond to start and stop codons) 

I 11 21 31 41 

I I I I I I 

CGAGATCCCG GGGAGCCAGC TTGCTGGGAG AGCGGGACGG TCCGGAGCAA GCCCACAGGC 60 
AGAGGAGGCG ACAGAGGGAA AAAGGGCCGA GCTAGCCGCT CCAGTGCTGT ACAGGAGCCG 120 
AAGGG ACGCA CCACGCCAGC CCCAGCCCGG CTCCAGCGAC AGCCAACGC C TCTT GCAGCG 1 80 
CGGCGGCTTC G AAGCCGCCG CCCGGAGCTG CCCrTTCCTC TTCGGTGAAG TTTTTAAAAG 240 
CTGCTAAAG A CTCGGAGGAA GCAAGGAAAG TGCCTGGTAG GACTGACGGC TGCCTTTGTC 300 
CTCCTCCTCT CCACCCCGCC TCCCCCCACC CTGCCTTCCC CCCCTCCCCC GTCTTCTCTC 360 
CCGCAGCTGC CTC AGTCGGC TACTCTCAGC CAACCCCCCT CACCACCCTT CTCCCCACCC 420 
GCCCCCCCGC CCCCGTCGGC CCAGCGCTGC CAGCCCGAGT TTGCAGAG AG GTAACTCCCT 480 
TTGGCTGCGA GCGGGCG AGC TAGCTGCACA TTGCAAAGAA GGCTCTTAGG AGCCAGGCGA 540 
CTGGGGAGCG GCTTCAGCAC TGCAGCCACG ACCCGCCTGG TTAGAATTCC GGCGGAGAGA 600 
ACCCTCTGTT TTCCCCCACT CTCTCTCCAC CTCCTCCTGC CTTCCCCACC CCG AGTGCGG 660 
AGCAGAGATC AAAAGATGAA AAGGCAGTCA GGTCTTCAGT AGCCAAAAAA CAAAACAAAC 720 
AAAAACAAAA AAGCCGAAAT AAAAG AAAAA GATAATAACT CA GTTCT TAT TTGCACCTAC 780 
TTCAGTGGAC ACTG AATTTG GAAGGTGGAG GATTTTGTTT TTTTCTTTTA AGATCTGGGC 840 
ATCTTTTGAA TCTACCCTTC AAGTATTAAG AGACAGACTG TG AGCCTAGC AGGGCAG ATC 900 
TTGTCCACCG TGTGTCTTCT TCTGCACGAG ACTTTG AGGC TGTCAGAGCG CTTTTTGCGT 960 
GGTTGCTCCC GCAAGTTTCC TTCTCTGGAG CTTCCCGCAG GTGGGCAGCT AGCTGCAGCG 1020 
ACTACCGCAT CATCACAGCC TGTTGAACTC TTCTGAGCAA GAGAAGGGG A GGCGGGGTAA 1080 
GGGAAGTAGG TGGAAG ATTC AGCCAAGCTC AAGGATGGAA GTGCAGTTAG GGCTGGGAAG 1 140 
GGTCTACCCT CGGCCGCCGT CCAAGACCTA CCG AGGAGCT TTCCAGAATC TGTTCCAG AG 1200 
CGTGCGCGAA GTGATCCAGA ACCCGGGCCC CAGGCACCCA GAGGCCGCGA GCGCAGCACC 1260 
TCCCGGCGCC AGTTTGCTGC TGCTGCAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA 1320 
GCAGCAGCAG CAGCAGCAGC AGCAGCAAGA GACTAGCCCC AGGCAGCAGC AGCAGCAGCA 1380 
GGGTGAGG AT GGTTCTCCCC AAGCCCATCG TAGAGGCCCC ACAGGCTACC TGGTCCTGGA 1440 
TGAGGAACAG CAACCTTCAC AGCCGCAGTC GGCCCTGGAG TGCCACCCCG AGAGAGGTTG 1500 
CGTCCCAGAG CCTGGAGCCG CCGTGGCCGC CAGCAAGGGG CTGCCGCAGC AGCTG CCAG C 1560 
ACCTCCGGAC GAGGATGACT CAGCTGCCCC ATCCACGTTG TCCCTGCTGG GCCCCACTTT 1620 
CCCCGGCTTA AGCAGCTGCT CCGCTGACCT TAAAGACATC CTGAGCGAGG CCAGCACCAT 1 680 
GCAACTCCTT CAGCAACAGC AGCAGGAAGC AGTATCCGAA GGCAGCAGCA CCGGGAGAGC 1740 
GAGGGAGGCC TCGGGGGCTC CCACTTCCTC CAAGGACAAT TACTTAGGGG GCACTTCGAC 1 800 
CATTTCTGAC AACGCCAAGG AGTTGTGTAA GGCAGTGTCG GTGTCCATGG GCCTGGGTGT I860 
GGAGGCGTTG GAGCATCTGA GTCCAGGGGA ACAGCTTCGG GGGGATTGCA TGTACGCCCC 1920 
ACTTTTGGGA GTTCCACCCG CTGTGCGTCC CACTCCTTGT GCCCCATTGO CCGAATGCAA 1980 
AGGTTCTCTG CTAG ACGACA GCGCAGGCAA GAGCACTGAA GATACTGCTG AGTATTCCCC 2040 
TTTCAAGGGA GGTTACACCA AAGGGCTAGA AGGCGAGAGC CTAGGCTGCT CTGGCAGCGC 2100 
TGCAGCAGGG AGCTCCGGGA CACTTGAACT GCCGTCTACC CTGTCTCTCT ACAAGTCCGG 2160 
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AGCACTGGAC G AGGCAGCTG CGTACCAGAG TCGOGACTAC TACAACTTTC CACTGGCTCT 2220 
GGCCGGACCG CCGCCCCCTC CGCCGCCTCC CCAT(XCCAC GCTCGCATCA AGCTGGAGAA 2280 
CCCGCTGGAC TACGGCAGCG CCTGGGCGGC TGCGGCGGCG CAGTGCCGCT ATGGGGAOCT 2340 
GGCOAGCCTG CATGGCGCGG GTGCAGCGGG ACCCGGTTCT GGGTCACCCT CAGCCGCCGC 2400 
TTCCTCATCC TGGCACACTC TCTTCACAGC CGAAGAAGGC CAGTTGTATG GACCGTGTGG 2460 
TGGTGGTGGG GGTGGTGGCG GCGGCGGCGG CGGCGGCGGC GGCGGCGGCG GCGGCGGCGG 2520 
CGGCGGCGGC GAGGCGGGAG CTGTAGCCCC CTAOGGCTAC ACTCGGOCCC CTCAGGGGCT 2580 
GGCGGGCCAG GAAAGCGACT TCACCGCACC TGATGTGTGO TACCCTGGCG GCATGGTGAG 2640 
CAGAGTGCCC TATCCCAGTC CCACTTGTGT CAAAAGCGAA ATGGGCCCCT GGATGGATAG 2700 
CTACTCCGGA CCTTACGGGG ACATGCGTTT GGAGACTGCC AGGGACCATG TTTTGCCCAT 2760 
TGACTATTAC TTTCCACCCC AGAAGACCTG CCTGATCTGT GGAGATGAAG CTTCTGGGTG 2820 
TCACTATGGA GCTCTCACAT GTGGAAGCTG CAAGGTCTTC TTCAAAAGAG CCGCTGAAGG 2880 
G AAACAGAAG TACCTGTGCG OCAGCAGAAA TG ATTGCACT ATTG ATAAAT TCCGAAGGAA 2940 
AAATTGTCCA TCTTGTCGTC TTCGGAA ATG TTATG AAGCA GGGATGACTC TGGG AGCCCG 3000 
G AAGCTGAAG AAACTTGGTA ATCTGAAACT ACAGGAGGAA GGAG AGGCTT CCAGCACCAC 3060 
CAGCCCCACT GAGGAGACAA CCCAGAAGCT GACAGTGTCA CACATTGAAG GCTATGAATG 3120 
TCAGCCCATC TTTCTGAATG TCCTGGAAGC CATTGAGCCA GGTGTAGTGT GTGCTGGACA 31 80 
CG ACAACAAC CAGCCCGACT CCTTTGCAGC CTTGCTCTCT AGCCTCAATG AACTGGGAGA 3240 
GAGACAGCTT GTACACGTGG TCAAGTGGGC CAAGGCCTTG CCTGGCTTCC GCAACTTACA 3300 
CGTGGACGAC CAGATGGCTG TCATTCAGTA CTCCTGG ATG GGGCTCATGG TGTTTGCCAT 3360 
GGGCTGGCGA TCCTTCACCA ATGTCAACTC CAGGATGCTC TACTTCGCCC CTGATCTGGT 3420 
TTTCAATGAG TACCGCATGC , AC AAGTCCCG GATGTACAGC CAGTGTGTCC GAATGAGGCA 3480 
CCTCTCTCAA GAGTTTGGAT GGCTCCAAAT CACCCCCCAG GAATTCCTGT GCATG AAAGC 3540 
ACTGCTACTC TTCAGCATTA TTCCAGTGGA TGGGCTGAAA AATCAAAAAT TCTTTGATGA 3600 
ACTTCGAATG AACTACATCA AGGAACTCG A TCGTATCATT GCATG CAAAA GAAAAAATCC 3660 
CACATCCTGC TCAAG ACGCT TCTACCAGCT CACCAAGCTC CTGGACTCCX3 TGCAGOCTAT 3720 
TGCGAGAG AG CTGCATCAGT TCACTTTTGA CCTGCTAATC AAGTCACACA TGGTG AGCGT 3780 
GGACTTTCCG GAAATGATGG CAGAGATCAT CTCTGTGCAA GTGCCCAAGA TCCTTTCTGG 3840 
GAAAGTCAAG CCCATCTATT TCCACACCCA GTGAAGCATT GGAAACCCTA TTTCCCCACC 3900 
CCAGCTCATG CCCCCTTTCA GATGTCTTCT GCCTGTTATA ACTCTGCACT ACTCCTCTGC 3960 
AGTGCCTTGG GGAATTTCCT CTATTG ATGT ACAGTCTG TC ATGAACATGT TCCTGAATTC 4020 
TATTTGCTGG GCT7TTTTTT TCTCTTTCTC TCCTTTCTTT 7TCTTCTTCC CTCCCTATCT 4080 
AACCCTCCCA TGGCACCTTC AGACTTTGCT TCCCATTGTG GCTCCTATCT GTGTTTTGAA 4140 
TGGTGTTGTA TGCCTTTAAA TCTGTGATGA TCCTCATATG GCCCAGTGTC AAGTTGTGCT 4200 
TGTTTACAGC ACTACTCTGT GCCAGCCACA CAAACGTTTA CTTATCTTAT GCCACGGGAA 4260 
GTTTAGAGAG CTAAGATTAT CTGGGGAAAT CAAAACAAAA AACAAGCAAA CAAAAAAAAA 4320 
A 

Sea ID NO: 94 Protein sequence: 
Protein Accession #: NP_000035. 1 

1 11 21 31 41 51 

i i I i i i 

MEVQLGLGRV YPRPPSKTYR GAFQNLFQSV REVIQNPGPR HPEAASAAPP GASLLLLQQQ 60 
QQQQQQQQQQ QQQQQQQQET SPRQQQQQQG EDGSPQAHRR GPTGYLVLDE EQQPSQPQSA 120 
LECHPERGCV PEPGAAVAAS KGLPQQLPAP PDEDDSAAPS TLSLLGPTFP GLSSCSADLK 180 
DILSEASTMQ LLQQQQQEAV SEGSSSGRAR EASGAPTSSK DNYLGGTSTI SDNAKELCKA 240 
VSVSMGLG VE ALEHLSPGEQ LRGDCMYAPL LGVPPAVRPT PCAPLAECKG SLLDDSAGKS 300 
TEDTAEYSPF KOGYTKGLEG ESLGCSGSAA AGSSGTLELP STLSLYKSGA LDEAAAYQSR 360 
DYYNFPLALA GPPPPPPPPH PHARIKLENP LDYGSAWAAA AAQCRYGDLA SLHGAGAAGP 420 
GSGSPSAAAS SSWHTLFTAE EGQLYGPCGG GGGGGGGGGG GGGGGGGGGG GGEAGAVAPY 480 
GYTRPPQGLA GQESDFTAPD VWYPGGMVSR VPYPSPTCVK SEMGPWMDSY SGPYGDMRLE 540 
TARDHVLPID YYFPPQKTCL ICGDEASGCH YGALTCGSCK VFFKRAAEGK QKYLCASRND 600 
CT1DKFRRKN CPSCRLRKCY EAGMTLG ARK LKKLGNUCLQ EEGEASSTTS PTEETTQKLT 660 
VSHIEG YECQ PIFLNVLEAI EPG VVCAGHD NNQPDSFAAL LSSLNELGER QLVHVVKWAK 720 
ALPGFRNLHV DDQMAVIQYS WMGLMVFAMG WRSFTNVNSR MLYFAPDLVF NEYRMHKSRM 780 
YSQCVRMRHL SQEFGWLQIT PQEFLCMKAL LLFSIIPVDG LKNQKFFDEL RMNYKELDR 840 
IIACKRKNPT SCSRRFYQLT KLLDSVQPIA RELHQFTFDL LIKSHMVSVD FPEMMAEHS 900 
VQVPK1LSGK VKPIYFHTQ 

Seq ID NO: 95 DNA sequence 

Nucleic Acid Accession fh NM J002497 

Coding sequence: 135-1 472 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCACGAGTA GGGGTGGCGG GTCAGTGCTG CTCGGGGGCT TCTCCATCCA GGTCCCTGGA 60 
GTTCCTGGTC CCTGGAGCTC CGCACTTGGC GCGCAACCTG CGTGAGGCAG CGCGACTCTG 120 
GCG ACTGGCC GGCCATGCCT TCCCGGGCTG AGG ACTATG A AGTGTTGTAC ACCATTGGCA 1 80 
CAGGCTCCTA CGGCCGCTGC CAGAAGATCC GGAGGAAGAG TGATGGCAAG ATATTAGTTT 240 
GG AAAGAACT TG ACTATGGC TCCATGACAG AAGCTG AGAA ACAGATGCTT GTTTCTGAAG 300 
TGAATTTGCT TCGTGAACTG AAACATCCAA ACATCGTTCG TTACTATGAT CGG ATT ATTG 360 
ACCGGACCAA TACAACACTG TACATTGTAA TGGAATATTG TGAAGGAGGG GATCTGGCTA 420 
GTGTAATTAC AAAGGGAACC AAGGAAAGGC AATACTTAGA TGAAGAGTTT GTTCTTCGAG 480 
TGATGACTCA GTTGACTCTG GCCCTGAAGG AATGCCACAG ACGAAGTGAT GGTGGTCATA 540 
CCGTATTGCA TCGGGATCTT AAACCAGCCA ATGTTTTCCT GGATGGCAAG CAAAACGTCA 600 
AGCTTGGAG A CTTTGGGCTA GCTAG AATAT TAAACCATGA CACGAGTTTT GCAAAAACAT 660 

388 



WO 02/059377 



TTGTTGGCAC ACCTTATTAC ATGTCTCCTG AACAAATGAA TCGCATGTCC TACAATGAGA 720 
AATCAG ATAT CTGGTCATTG GGCTGCTTGC TGTATGAGTT ATGTGCATTA ATGCCTCCAT 780 
TTACAGCTTT TAGCCAGAAA GAACTCGCTG GGAAAATCAG AGAAGGCAAA TTCAGGCGAA 840 
TTCCATACCG TTACTCTGAT GAATTGAATG AAATTATTAC GAGGATGTTA AACTTAAAGG 900 
ATTACCATCG ACCTTCTGTT GAAGAAATTC TTG AGAACCC TTTAATAGCA GATTTGGTTG 960 
CAGACGAGCA AAGAAGAAAT CTTGAGAGAA GAGGGCGACA ATTAGGAGAG CCAGAAAAAT 1020 
CGCAGGATTC CAGCCCTGTA TTGAGTGAGC TGAAACTGAA GGAAATTCAG TTACAGGAGC 1080 
GAGAGCGAGC TCTCAAAGCA AGAGAAGAAA GATTGGAGCA GAAAGAACAG GAGCTTTGTG 1 140 
TTCGTGAGAG ACTAGCAGAG GACAAACTGG CTAGAGCAGA AAATCTGTTG AAGAACTACA 1200 
GCTTGCTAAA GGAACGGAAG TTCCTGTCTC TGGCAAGTAA TCCAGAACTT CTTAATCTTC 1260 
CATCCTCAGT AATTAAGAAG AAAGTTCATT TCAGTGGGGA AAGTAAAOAG AACATCATGA 1320 
GGAGTG AG AA TTCTG AGAGT CAGCTCACAT CTAAGTCCAA GTGCAAGGAC CTGAAGAAAA 1380 
GGCTTCACGC TGCCCAGCTG CGGGCTCAAG CCCTGTCAGA TATTGAGAAA AATTACCAAC 1440 
TGAAAAGCAG ACAGATCCTG GGCATGCGCT AGCCAGOTAG AGAGACACAG AGCTGTGTAC 1500 
AGGATGTAAT ATTACCAACC TTTAAAGACT GATATTCAAA TGCTGTAGTG T TGAAT ACTT 1560 
GGCCCCATGA GCCATGCCTT TCTGTATAGT ACACATGATA TTTOGGAATT GGTTTTACTG 1620 
TTCTTCAGCA ACTATTGTAC AAAATGTTCA CATTTAATTT TTCTTTCTTC TTTTAAGAAC 1680 
ATATTATAAA AAGAATACTT TCTTGGTTGG GCTTTTAATC CTGTGTGTGA TTACTAGTAG 1740 
GAACATGAGA TGTGACATTC TAAATCTTGG GAG AAAAAAT AATATTAGG A AAAAAATATT 1800 
TATGCAGGAA GAGTAGCACT CACTGAATAG TTTTAAATGA CTGAGTOGTA TGCTTACAAT 1860 
TGTCATGTCT AGATTTAAAT TTTAAGTCTG AGATTTTAAA TGTTTTTGAG CTTAGAAAAC 1920 
CCAGTTAGAT GCAATTTGGT CATTAATACC ATGACATCTT GCTTATAAAT ATTCCATTGC 1980 
TCTGTAGTTC AAATCTGTTA GCTTTGTG A A AATTC ATC AC TGTGATGTTT GTATTCTTTT 2040 
TTTTTTTCTG TTTAACAGAA TATGAGCTGT CTGTCATTTA CCTACTTCTT TCCCACTAAA 2100 
TAAAAG AATT CTTCAGTTA 

Seq ID NO. 96 Protein sequence: 
Protein Accession*: NPJW2488 

1 11 21 31 41 51 
I I I I I I 

GIREFNPNIM ANEVERINMI TOSISGENEA RELATEDKIN ASEHOMOSAP IENSMPSRAE 60 
DYEVLYTIGT GSYGRCQKIR RKSDGKILVW KELDYGSMTE AEKQMLVSEV NLLRELKHPN 120 
rVRYYDRJID RTNTTLYIVM EYCEGGDLAS VITKGTKERQ YLDEEFVLRV MTQLTLALKE 180 
CHRRSDGGHT VLHRDLKPAN VFLDGKQNVK LGDFGLARIL NHDTSFAKTF VGTPYYMSPE 240 
QMNRMSYNEK SDIWSLGCLL YELCALMPPF TAFSQKELAG KJREGKFRRJ PYRYSDELNE 300 
OTRMLNLKD YHRPSVEE1L ENPLIADLVA DEQRRNLERR GRQLGEPEKS QDSSPVLSEL 360 
KUCEIQLQER ERALKAREER LEQKEQELCV RERLAEDKLA RAENLLKNYS LLKERKFLSL 420 
ASNPELLNLP SSVIKKKVHF SGESKENIMR SENSESQLTS KSKCKDLKKR LHAAQLRAQA 480 
LSDIEKNYQL KSRQILGMR 

< 

Seq ID NO: 97 DNA sequence 

Nucleic Acid Accession #: NM 007050.2 

Coding sequence: 1 85-4576 (underlined sequences correspond to start and stop codons) 
1 II 21 31 41 51 

CCTCCCGCCT CAGTTCGCGC CGCGCCTCGG CTTGGAACGC AGGAGCGCOG GCTCCGGGAG 60 
CCCG AGCGG A GCCAGCCGCG CGCACAGCCA GCGGCCGCGC CGGCGATGCG GGGCCACCCC 120 
GCGCCCGCCC CAGTCCCGGC CCCGGCCCCC GCGGGAAGGG GCTGAGCTGC CCGCCGCCGC 1 80 
CCGGATGGCG AGCCTCGCCG CGCTCGCCCT CAGCCTGCTC CTGAGGCTGC AGCTGCCGCC 240 
ACTGCCCGGC GCCCGGGCTC AGAGCGCCCC AGGTGGCTGT TCCTTTGATG AGCACTACAG 300 
CAACTGTGGT TATAGTGTGG CTCTAGGGAC CAATGGGTTC AC CTGG GAGC AGATTAACAC 360 
AACGGAGAAA CCAATGCTGG ACCAGGCAGT GCCCACAGGA TCTTTCATGA TGGTGA ACAG 420 
CTCTGGGAGA GCCTCTGGCC AGAAGGCCCA CCTTCTCCTG CCAACCCTGA AGGAGAATGA 480 
CACCCACTGC ATCGACTTCC ATTACTACTT CTCCAGCCGT GACAGGTCCA GCCCAGGGGC 540 
CTTGAACGTC TACGTG AAGG TGAATGGTGG CCCCCAAGGG AACCCTGTGT GGAATGTGTC 600 
CGGGGTCGTC ACTG AGGGCT GGGTG AAGGC AGAGCTCGCC ATCAGCACTT TCTGGCCACA 660 
TTTCTATCAG GTGATATTTG AATCCGTCTC ATTG AAGGGT CATCCT GGCT ACATCGCCGT 720 
GG ACGAGGTC CGGGTCCTTG CTCATCCATG CAGAAAAGCA CCTCATTTTC TGCGACTCCA 780 
AAACGTGGAG GTGAATGTGG GGCAGAATGC CACATITCAG TGCATTGCTG GTGGGAAGTG 840 
GTCTCAGCAT G ACA AGCTTT GGCTCCAGCA ATGG AATGGC AGGG ACACGG CCCTGATGGT 900 
CACCCGTGTG GTCAACCACA GGCGCTTCTC AGCCACAGTC AGTGTGGCAG ACACTGCCCA 960 
GCGGAGCGTC AGCAAGTACC GCTGTOTGAT CCGCTCTGAT GGTGGGTCTG GTGTGTCCAA 1020 
CTACGCGGAG CTGATCGTGA AAGAGCCTCC CACGOCCATT GCTCCCCCAG AGCTGCTGGC 1080 
TGTGGGGGCC ACATACCTGT GG ATCAAGCC AAATGCCAAC TCCATCATCG GGGATGGCCC 1 140 
CATCATCCTG AAGGAAGTGG AATATCGCAC CACCACAGGC ACGTGGGCAG AGACCCACAT 1200 
AGTCGACTCT CCCAACTATA AGCTGTGGCA TCTGGACCCC GATGTTGAGT ATGAGATCCG 1260 
AGTGCTCCTC ACACGACCAG GTGAGGGGGG TACGGGACCG CCAGGGGCTC CCCTCACCAC 1320 
CAGGACCAAG TGTGCAGATC CGGTACATGG CCCACAGAAC GTGGAAATCG TAGACATCAG 1380 
AGCCCGGCAG CTGACCCTGC AGTGGGAGCC CTTCGGCTAC GCGGTGACCC GCTGCCATAG 1440 
CTACAACCTC ACCGTGCAGT ACCAGTATGT GTTCAACCAG CAGCAGTAOG AGGCCGAGGA 1500 
GGTCATCCAG ACCTCCTCCC ACTACACCCT GCGAGGCCTG CGCCCCTTCA TGACCATCCG 1560 
GCTGCGACTC TTGCTGTCTA ACCCCGAGGG CCGAATGGAG AGCGAGGAGC TGGT GGTG CA 1620 
GACTGAGG AA GACGTTCCAG GAGCTGTTCC TCTAGAATCC ATCCAAGGGG GGCCCTTTGA 1680 
GGAGAAGATC TACATCCAGT GGAAACCTCC CAATGAGACC AATGGGGTCA TCACGCTCTA 1740 
CGAG ATCA AC TACA AGGCTG TCGGCTCGCT GGACCCAAGT GCTGACCTCT CG AGCCAGAG 1 800 
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GGGG AAAGTG TTCAAGCTCC GGAATGAAAC CCACCACCTC TTTGTGGGTC TGTACCCAGG 1 860 
GACCACCTAT TCCTTCACCA TCAAGGCCAG CACAGCAAAG GGCTTTGGGC OCCCTGTCAC 1920 
CACTCGGATT G CCA CCA AAA TTTCAGCTCC ATCCATGCCT GAGTACGACA CAGACACCCC 1980 
ATTG A ATGAG ACAG ACACG A CCATCACAGT G ATGCTG A AA CCCGCTCAGT CCCGGGGAGC 2040 
TCCTGTCAGT GTTTATCAGC TGGTTGTCAA GGAGGAGCGA CTTCAGAAGT CACGGAGGGC 2100 
AGCTGACATT ATTGAGTGCT TTTCGGTGCC CGTGAGCTAT CGGAATGCCT CCAGCCTCGA 2160 
TTCTCTACAC TACTTTGCTG CTGAGTTG A A GCCTGCCAAC CTGCCTGTCA CCCAGCCATT 2220 
TACAGTGGGT GACAATAAGA CATACAATGG CTACTGGAAC CCTCCTCTCT CTCCCCTGAA 2280 
AAGCTACAGC ATCTACTTCC AGGCACTC AG CAAAGCCAAT GGAGAGACCA AAATCAACTG 2340 
TGTTCGTCTG GCTACAAAAG CACCAATGGG CAGCGCCCAG GTGACCCCGG GGACTCCACT 2400 
CTGCCTCCTC ACCACAGGTG CCTCCACCCA GAATTCTAAC ACTGTGGAGC CAGAG AAGCA 2460 
GGTGGACAAC ACCGTGAAGA TGGCTGGCGT G ATCGCTGGC CTCCTCATGT TCATCATCAT 2520 
TCTCCTGGGC GTGATGCTCA CCATCAAAAG GAGAAGAAAT GCTTATTCCT ACTCGTATTA 2580 
CTTGTCCCAA AGGAAGCTGG CCAAGAAGCA GAAGGAGACC CAGAGTGGAG CCCAGAGGGA 2640 
GATGGGGCCT GTGGCCTCTG CCGACAAACC CACCACCAAG CTCAGCGCCA GCCGCAATGA 2700 
TGAAGGCTTC TCTTCTAGTT CTCAGGACGT CAACGGATTC ACAGATGGCA GCCGCGGGGA 2760 
GCTTTCCCAG CCCACCCTCA CGATCCAGAC TCATCCCTAC CGCACCTGTG ACCCTGTGGA 2820 
GATGAGCTAC CCCCGGGACC AGTTCCAACT COCCATCCGG GTGGCTGACT TGCTGCAGCA 2880 
CATCACGCAG ATGAAG AGAG GCCAGGGCTA CGGGTTCAAG GAGGAATACG AGGCCTTACC 2940 
AGAGGGGCAG ACAGCTTCGT GGG ACACAGC CAAGGAGGAT G AAAACCGCA ATAAGAATCG 3000 
ATATGGGAAC ATCATATCCT ACGACCATTC CCGGGTG AGG CTGCTGGTGC TGGATGGAGA 3060 
CCCGCACTCT GACTACATCA ATGCCAACTA CATTGACGGA TACCATCGAC CTCGGCACTA 3 120 
C ATTG CG ACT CAAGGTCCG A TGCAGG AG AC TGTAAAGG AC TTTTGG AG AA TGATCTGGCA 3 180 
GGAG AACTCC GCCAGCATCG TCATGGTCAC AAACCTGGTG GAAGTGGGCA GGGTGAAATG 3240 
TGTGCGATAC TGGCCAGATG ACACGGAGGT CTACGGAGAC ATTAAAGTCA CCCTGATTGA 3300 
AACAGAGCCC CTGGCAGAAT ACGTCATACG CACCTTCACA GTCCAGAAGA AAGGCTACCA 3360 
TG AGATCCGG GAGCTCCGCC TCTTCCACTT CACCAGCTGG CCTG ACCACG GCGTTCCCTG 3420 
CTATGCCACT GGCCTTCTGG GCTTCGTCCG CCAGGTCAAG TTCCTCAACC CCCCGGAAGC 3480 
TGGGCCCATA GTGGTCCACT GCAGTGCTGG GGCTGGGCGG ACTGGCTGCT TCATTGCCAT 3540 
TGACACCATG CTTGACATGG CCGAGAATG A AGGGGTGGTG GACATCTTCA ACTGCGTGCG 3600 
TGAGCTCCGG GCCCAAAGGG TCAACCTGGT ACAGACAGAG GAGCAATATG TGTTTGTGCA 3660 
CGATGCCATC CTGGAAGCGT GCCTCTGTGG CAACACTGCC ATCCCTGTGT GTGAGTTCCG 3720 
TTCTCTCTAC TACAATATCA GCAGGCTGGA COCCCAGACA AACTCCAGCC AAATCAAAGA 3780 
TGAATTTCAG ACCCTCAACA TTGTGACACC CCGTGTGCGG CCCGAGGACT GCAGCATTGG 3840 
GCTCCTGCCC CGGAACCATG ATAAGAATCG AAGTATGGAC GTGCTGCCTC TGGACCGCTG 3900 
CCTGCCCTTC CTTATCTCAG TGGACGGAGA ATCCAGC AAT TACATCAACG CAGCACTGAT 3960 
GGATAGCCAC AAGCAGCCTG CCGCCTTCGT GGTCACCCAG CACCCTCTAC CCAACACCGT 4020 
GGCAG ACTTC TGG AGGCTGG TGTTCGATTA CAACTGCTCC TCTGTGGTGA TGCTGAATGA 4080 
GATGGACACT GCCCAGTTCT GTATGCAGTA CTGGCCTGAG AAGACCTCCG GGTGCTATGG 4140 
GCCCATCCAG GTGG AGTTCG TCTCCGCAGA CATCG ACGAG GACATCATCC ACAGA ATATT 4200 
CCGCATCTGT AACATGGCCC GGCCACAGGA TGGTTATCGT ATAGTCCAGC ACCTCCAGTA 4260 
CATTGGCTGG CCTGCCTACC GGGACACGCC CCCCTCCAAG CGCTCTCTGC TCAAAGTGGT 4320 
CCGACGACTG GAGAAGTGGC AGGAGCAGTA TGACGGGAGG GAGGGACGTA CTGTGGTCCA 4380 
CTGCCTAAAT GGGGGAGGCC GTAGTGGAAC CTTCTGTGCC ATCTGCAGTG TGTGTGAGAT 4440 
GATCCAGCAG CAAAACATCA TTGACGTGTT CCACATCGTG AAAACACTGC GTAACAACAA 4500 
ATCCAACATG GTGGAGACCC TGGAACAGTA TAAATTTGTA TACGAGGTGG CACTGGAATA 4560 
TTTAAGCTCC TTTTAGCTCA ATGGGATGGG GAACTGCCGG AGTCCAGAGG CTGCTGTGAC 4620 
CAAGCCCCCT TTTGTGTGAA TGGCAGTAAC TGGGCTCAGG AGCTCTGAGG TGGCACCCTG 4680 
CCTGACTCCA AGGAG AAGAC TGGTGGCCCT GTGTTCCACG GGGGGCTCTG CACCTTCTGA 4740 
GGGGTCTCCT GTTGCCGTGG G AG ATGCTG C TCCAAAAGGC CCAGGCTTCC TTTTCAACCT 4800 
AACCAGCCAC AGCCAAGGGC CCAAGCAG AA GTACACCCAC AAGCAAGGCC TTGGATTTCT 4860 
GGCTCCCAGA CCACCTGCTT TTGTTCTGAG TTTGTGGATC TCTTGGCAAG CCAACTGTGC 4920 
AGGTGCTGGG GAGTGGGAGG CTCCCCTGCC CTXXTTCTCC TTAGGAGTGG AGGAG ATGTG 4980 
TGTTCTGCTC CTCTACGTCA TGGAAAAGAT TGAGGCTCTT GGGGGTCACT GCTCTGCTGC 5040 
CCCCTGCAAC CTCCTTCAGG GGCCTCTGGC ACCAGACATT TGCAGTCTGG ACCAGTGTGA 5100 
CCTTACGATG TTCCCTAGGC CACAAGAGAG GCCCCCCATC CTCACACCTA ACCTGCATGG 5160 
GGCTTCGCCC ACAACCATTC TGTACCCCTT CCCCAGCCTG GGCCTTGACC G TCCAGCA TT 5220 
CACTGGCCGG CCAGCTGTGT CCACAGCAGT TTTTGATAAA GGTGTTCTTT GCTTTTTTGT 5280 
GTGGTCAGTG GGAGGGGGTG GAACTGCAGG GAACTTCTCT GCTCCTCCTT GTCTTTGTAA 5340 
AAAGGGACCA CCTCCCTGGG GCAGGGCTTG GGCTGACCTG TAGGATGTAA CCCC TGTGTT 5400 
TCTTTGGTGG TAGCTTTCTT TGGAAGAGAC AAACAAG ATA AGATTTG ATT ATTTTCCAAA 5460 
GTGTATGTGA AAAGAAACTT TCTTTTGGAG GGTGTAAAAT CTTAGTCTCT TAT GTCAA AA 5520 
AGAAGGGGGC GGGGGAGTTT GAGTATGTAC CTCTAAGACA AATCTCTCGG GCCTTTTATT 5580 
TTTTCCTGGC AATGTCCTTA AAAGCTCCCA CCCTGGGACA GCATGCCACT GAGCAAGGAG 5640 
AGATGGGTGA GCCTGAAGAT GGTCCCTTTG GTTTCTGGGG CAAATAG AGC ACCAGCTTTG 5700 
TGCATAATTT GGATGTCCAA ATTTGAACTC CTTCCTAAAG AAACCCAGCA GCCACCTTGA 5760 
AAAAGGCCAT TGTGGAGCCC ATTATACTTT GATTTAAAAT AGGCCAAGAG AATCAGGCCT 5820 
GGAGATCTAG GGTCTTGTCC AAAGTGTGAG TGAGTCAATG AGAGGGAACC AACATTTGCT 5880 
AAGTCTCTAC TGTATGCCAG GGATCATGCT TGGCACTTTC CATAGGACAT TTCACACAGT 5940 
CCTTAG AACC CCC AGG AG AG AGCTACTG AC TTGTTATCAT CTCCATTTG A TCATCTCCTC 6000 
CAATGAGG AA ACCCACGCAC CTTCCTTAGT AATGAAATCC TGGGTTCCAA AGGGGCAGGT 6060 
AATGGCAATG AGACTTCTCC GTGCTGTTTT CTTCATCTTC TCTAAGCCAA GCAATTATTT 6120 
TATGGAGGGA AAATAAGGCC AGAAACTTCT GAGCAGATAA CTCCACAAA T GGA AATTTAG 6180 
TACTTTCTTC CTGATGCCAG TTCTTCTGGG AAGCGCAGAA TTTCAOATAT ATTTTAGTAA 6240 
C ACATTCCCA GCTCCCCAGG AAAGCCAGTC TCATCTA ATT TCTTAGTCAG TAAAAACAAT 6300 
TCCCTGTTCC TTCAGGCTAT GAATGGACCA GCCAGGG AAA CTCTCGACCT TGATCTCTAG 6360 
CCAGTGCTTA GGCCCAATAT CTGACAGCCT CAGGTGGGCT GGGACCTAGG AAGCTCCATC 6420 
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TTGAAGGCTG GTCTAGCCCC AGACAGGGCA TGAGGGGCAG AGAATTCAAG AAGGTACAGC 6480 
TTTGGCCCTC AAGAGCCCAC TGTATGCTGG GGAAATGGAA CCATGGTGCA GTAGTGTGGA 6540 
GTGiBATG AGT GTTCCATGAG CCTAGG AGCA AGAAAGTCTC TTCGGCCTCG GGCTTCCTGG 6600 * 
AGAAGGGOAC GTCCATTCCT GCTGGGTCTT AACAAGCATA AAAAGG AAAA AAAGGAAACT 6660 
CAGGCAAAGG GATCCATATG TGCAATGGCA AAGAAATGTG AAAAGGCATT GGG AGAAGCA 6720 
GTCTGGGGGA GGCCAGCCCA GTGCGGGCAC AGCACAACAC GGGGAGCAGC AAGAGATGAG 6780 
CCAGGGTCCA GGAG ACAGAT GCCCATCGCG AGTACAGACT TTGTCCTATT GGCAACAAGG 6840 
AGTCCATGG A GCTTTAGAGA G ATGCACTCA GCTTCGTGTT GGCCA AG ACT CCTTCTGGGC 6900 
CAATGGGGCT GCCTCTTTTC CTTTCATCAG ACACTGTGAA AACATTCCCT TAAGCGTGCA 6960 
CTTTTTA ATA TCACATCTAT TTGTCTGTCT GCTCATTGTT TTGTTGCTGG AACTAAATAT 7020 
GCAATGG ATC ATG AGACTCA GATTCTATGA GAAACCCAGG GTCTCTGCTT TACCACGG AG 7080 
CAGGGTCACC AACCCAGATC TCCAGGCCCA TGAGGATGGA ACATGAAAGG AGCCGACAAA 7140 
AGTTGCTTCC ATTGGCATGG GCTCTGGAGC TGTCCAGAAG TCCAGGGACA CCAGACTTGA 7200 
TCAAGGAAGG GCTGTCACTT TAGAGGTTCA AAAGGAAGTG CCTCAAAGCA AAGGCAAGCA 7260 
AAGGAACCCC ACGATGAACT TGCT CTTTTC CTTTGATGAG CCTCTCCCCA GGTGTATTTC 7320 
AGCAGACCCC GGGGACCCAC CCCCACTGGG CCTGCTGGCC TCCCTCGGCT CCAGCCCAAT 7380 
GCCCCAGCTG GCCTTCCCCA GCCTGCAAGG AGCCTGTAGC ATGGCAAATC TGCCTGCTGT 7440 
ATGCTATTTT CTTAGATCTT GGTACATCCA GACAGGATGA GGGTGGAGGG AGAGCTATTT 7500 
AACACAAATC CTAAGATTTT TTTCTGCTCA GGAAGGGGTG AAATAGCTGG CAGATACAAA 7560 
AGACAGTGGC TTTTATCATT TTAAATGGTA GGAATTTAAG GTGTGACTTC AGGGAGAAAC . 7620 
AAACTTGCAA AAAAAAAAAA TCTCAGGCCA TGTTGGGGTA ACCCAGCAAG GGCCAGTGAT 7680 
GATTTCCCCC AGCTCATCCC CTTATTTTCC CACAACCCAA CCATTCTCTA AAGCAGGACA 7740 
GTGAATAGGT CTTAGGCCAG TGCACACAGG AAGAAATTGA GGCTTATGGA TG GGG ATG AC 7800 
TTCCCTAAGA TCCCATGQGA CAAGGATGTG GCAAGGCTTG GATGAGATGG GGCACCAGTG 7860 
CCCAGGAATT TGAACATTTT CCTTTACCCA GGAAATCTCC GGAGCCAACA CCACCACCCC 7920 
CAGGGGGTCT CCCCACCCCA CCCCATTTAC AGGGTGAGCT CAGCCTGTCA TGAGCAGAGG 7980 
AAAATATTAT TAATGCTCTC TGAGTCTTTA CAACAGGAGC TCTTACCTCA TAGATGTGGG 8040 
CTCTGTTTGG GGAAGATGCA AGGAAGTAAT GAGAAGCCCA GGAAATTTCT CCACCTGTQT 8100 
TTATGGCCTA AATAGCTTCA GGATGTATCT TAGCTGCACT CCAACATTGC ATCCTTTCTG 8160' 
GGGTGAAGAA TCTGGGCCAA CCAGGGGTCC TTGGGCCTCT AGAAGGCCAC AGTAGGCCTC 8220 
TCTTTGTGGG AATGGAAGGG GACAGTTTGC TTTTAOTGCT GGCCCTCTCT GTGGGTGTGG 8280 
CCTGCAAAGG AACCAACAGA CCCTATGCTG GGGACTCTAA CATGTGAGCT CATTAAATTC 8340 
TTCCAGCATT CTAAAGGAGG GTTTGTGATT GTCACCATTT ACTG ATG AGG AAACTAAGGC 8400 
TCCTAGGGGA GAAATCACTT GCCCACAGTT CCACAGCTAG TGAGTGAATG AAC CAGGA TT 8460 
TAAACCGGTT TTTTCTCACT ACAGAGACAA TATTTTTCCA CCATTGTATC TCACATTTTT 8520 
CCCAGGAGGT TACCCATAAC AGAAGAGACT AGAGTGGAAC AGATACGTCA GTGGATAAAO 8580 
CTCAAAGCAA ACAACAGTAA GCTTAAAATT CCTTCATAGT CT CATGT TTT ACGTTCACAA 8640 
TTCATGCAAA ATTTGCATTC CACTTTCTGA TTTAGCCTTG TTGGTTTTAA TATGACTCTA 8700 
TGAATATTTC AAAAAAAAAT GTGCTCTGTT CCTCATGTTG TTCTGTTCTG TTCACCCCGC 8760 
TATGACGG AC CCTAGGTCAG CTGGTCTTCA GCTTG ACCCT AGAATTGACT CTAGGAGCAG 8820 
TG ACCCTGCT GCCTCCCAG A GCCAGTTATA GGCTCAAGAT CAAGACCAAC TGACCTTCTC 8880 
CTAGGCAGCT CCTTTGGTGT GTGGGTGCTC TGACCTCACT GTTCATGAGG GGACCTCAAC 8940 
TAAGGCATCT TCCAGTTGGG TGCTGGAAGG AACCCATTAA CTCACACTAG AATGATGAGG 9000 
ATTTGCTCAT CTGGCGTGGA GAAGGATGAG CCCACAAAAC CCTAAAGGGA AAAGAGAAGC 9060 
TGGACACAGC TGTACTCAGC AGATTCCTGA ATGCTAGGCT GGAAAGTGGT GCCTGTTOTC 9120 
CAAGTGGAGT CACATGGTTG CTAATGTGGG CAAGTCTGAG GACACACTTC ATGAGCAGCT 9180 
GGGGTCTGGA AGGCTCCTCA CTTTACCCTA GCCACACATA ATTACTGGGT GCCTACAGCA 9240 
CCTAGCACCT TGG AGGGGGC ACTATTAGG A AATCGAGATT ACTATGGCAC AATTAATTCC 9300 
TGGGTAAGGC ATGGGGTTGT GGTGGACAGA GCTCAGTCTT TAGTTTGAAC GAAAACATAC 9360 
ATACATGAAA AACATACATG AAAAAAGGAC CCTCATCAAC ATTAGAAGGG GTAGATTTGG 9420 
AGCACTTTAG GCAGGAAAAC AGGAACGCAA GGCCAGG AAA CTGGAACCCA GTGAATACTC 9480 
AGAACCGAGG ATGCAGATGA CTTATTTAGC AAAATGGTCA CTTCTGTG AC ATAGCTGGAG 9540 
AAAGGATGGG TAACAGCTTG CCAGAGCCAC TTGGAACAAG GGCAAATCTC AGTGTCTGGG 9600 
GCAA AAGATG ATGCATTTCC CTCTG ACCCA TCATGTTTAT TCATCCTCCA CTCCCCATTG 9660 
CCACACTAGC TCTTGCTGTA AGTCCTCACC AGGATCTACA TTTCCTCGTC GCTGGTGGGA 9720 
ACCCCTTAGA GTACATAGAG GTATCAGTCC AGTAAGACTG CTCTACACAA CAGAAGTGAG 9780 
GCCCAGGGAG TAGCAGCCAG GCCCTTATCC TGTTACCTCT OCAGG AGTG A CTGCCCAACC 9840 
CAGATCCAGA GACATTG AAG GAAATGATAA TTCCTTGGTA CCTCACTGCC TTGGGACAAA 9900 
ATGAAGAAAG CC ACCCTTCC TT AGGCTGCA GCTTGCCACT CCTGGGCTGG GTAAACAGGT 9960 
CATCAGCACC AGGCTCAACC AGGAGTAACA TTCTGGAAGA CATGGGTGAG CCCAAGAGGA 10020 
AGCATGAACA GGACGCTGTT CCTAAGTCAT GTCAACAGGT TGTGCTGGGC CAGGATCCCC 10080 
AGGGAAAAAA ATGGTCAACC CAACTGGAGG GTAGGTTAGA AG AAA AA AAA CATAAACGTG 10140 
GATAGTCATG TCATCTCAAA TCCCTGACTT GGCTTCCCCA TTACTTGACA GTCTGAGCTC 10200 
CTT C TTAGCC TGTGACCAGC TTCAAATCAC AGCCAAGTAA AACAAGGAAA TAGGAAAAGT 10260 
AAATCCAACT AGAAGAGACA AGCTGAGATT CAGATTTGTT TACTCCTCCC ATGCAAAGTT 10320 
TCCCTGTTGG AGGTTTTCCA TGTATACATG TCTAGAAGTG ATAGAATGCA AGGCCTTGGC 10380 
TTTGTCTTGC AGGGATCTGC CTTTGAGGTC ATAGACTGAA CAGCAGGGAG AGAGGTTAGT 10440 
GGTGGAGTGT GGGGGGAGCT GTTCTAGCTC CAGTTTCTTC TGACACATTT TTCAGGATCA 10500 
TGGATCTGAT CCTCCGAAGC ACAGCAGAGA TATCTAAGCC ATATTTGTGC ACATGAGCAG 10560 
ACTCTTCTAG TTTTTTAGTA ACCAGGGATG GGCTTTTGCA TGGCACTGAC TATAGAGATG 10620 
TCTTGTAGAG ATCAAGCCAG TCTTTTGCAT CCCACCTGCC CACCTCCAGA AGAGATGGGA 10680 
AAAGGTCATC AAAGGGCATT CACCAACTGA AATCCACTCA TGAATGTTAG GTCTCTAAAA 10740 
GGAGGCATCA ACACTCACAA TGGTAGCCTC CAAACCTAGC ATCCCACCTA TCTAAGAGCT 10800 
CAGGGGTGGT CCACTGGGGC AGATACAAGG GAAGTGCAAG GGCTCAGGAT GAAAGAAAAT 10860 
CTATTGGGAA GAGTTTTAGG G GCTTG ATC A TTATGGGGCT TCCTTCTATA TCTGAGAACT 10920 
GCTCTGGGTG GTGAGATGTG GACTCTGATC CTTAATTGGA ATGTTCGGAG AATGAGTGTC 10980 
TGGTGGCCTT GAAGTGTTGG ACAGAAAAGT ATCAGTATAA AAGCCTGGAG CTCAGGGTAA 1 1040 
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TTAATGTAGT TCATGGTTCC TTAGTGAGCA GGACTCTTGG ATGTGGAGGA GAAAGGGTCA 1 11 00 
TAGGAAGTAA ACCACCAAAA TTACAAAATT GAGTCTCTGT ACAATTACTT CAGTGCCTTT 1 1 160 
GGGCTTATGA ATACAAATCA GTGGGCCTTC TCTATGATGG TCCAACAAAC TCTCAGTGTC 1 1220 
CACCCTGTCC CTGTATCTCC CATGGAAGAT GAATAATGTC AGGTGTTCTT TGGGTCAAAG 1 1280 
GCCCCAGGGC AGTCTGGAGG CTTAGAGGGC AGAGTGGTGT CATTCCATGT AAAGTTAGGC 1 1340 
TTCTGAGGGG TCAGGCAGAA TATGGTGTCC ATATCTTCCA TAGCTCTGCA GATTCTTGGA 1 1400 
TGAAGTCA AG CACAGTTTGC TAGACCCAGG TCACTCCTCT GAGTATAACT AGG ACCCATG 1 1460 
AGTGAAACTT AATAGCTGTA AGGAAGAACC TGCTGTCTGC CAGAGAGGAT AAGCTGCOCA 11520 
TCTCAGCAGC TGTCTAAAAG AAGGCAGGTG TCTCTTTAAA GGGAAGAGAA GCATTGGTGA 11580 
AATGGATTTC AGGTCACTTC CATTCCAGAT GGGTGAGATC TTGTGGAGCT GGGATCATGT 1 1640 
TTGAACTCAT TCATACCTGT AGAGCACG AA TCCAAGTAOA TTGTGTTTGG TCTGTACAGG 11700 
CTGAAGCCCC CTGCTCTCCC ACCCAAGTGC CCCCACTGAG CAGGCCAACA TGCTGTTGTG 1 1760 
GCCACATATA CTGGGCTGAT CCAGGCTGGT TATCACCAAA CAGCAAACCA TAGGGAACAG 1 1820 
CTGCTTTGCC ATAGACCCAA TACCCA7GTA GATCTCTCAT GAGAGCAGCC ATAACTCAGA 11880 
CCCACTGACC AACAGGGCCA TGAGTGACAG CCAGAACCAG TGAAGGTCCA AGTAGGACAC 1 1940 
AG AGCAGGGC TTTTCTTACC ATACACATTA TCTCCAGAGG TTATTTCTAC CCCACTCCCT 12000 
ATTCAAGGCC TGTTGGAGCA CACTGCAAAA GCAAAAGCAC AGTAACTCAA TTTACACATG 12060 
ATTATAATCA TTTCCAGTGC ACACATTTCA TCACCAGGTG GA TCCTO AGC TAGCCCATGT 12120 
AAATCCGGGT T A ACCCAT AT TGGTAATC AT ACTCAAAAGC ACTTTTCACC CTACATTCTA 12180 
CTAGCCAATC AAAGACAAAG AGTTGTGGCC TCTACCATTG CCTTGGCTTC TGGACACCCT 12240 
CACAAGCTAT CCCAAGGTTC CCGCTCAACT CCAGGGAGGC TGACATCTTC ACATCCACTG 12300 
GGCATATAAT ATTGCATGAG ACCAAAGTCT CCACACTCTT TGCAGCCTCC TCCATGAATC 12360 
CCAATGGCCT GCACTTGTAC AGTTTGGGTG TTTGATAGAT AAAGCACGTA TGAGAAGAGA 12420 
AAACAAAATA AATCAACTTT TTAAAAAAGC CACCACTGTG CTGTCAATGT TTTTTTTTTC 12480 
TTTTCAATTC TAGCTCAGAA AAGCAGAAGG TAAATAATGT CAGGTCAATG AATATCAGAT 12540 
ATATTTTTTG ACTGTACATT ACAGTGAAGT GTAATCTTTT TACACCTGCA AGTCCATCTT 12600 
ATTTATTCTT GTAAATGTTC CCTGACAATG TTTGTAATAT GGCTGTGTTA AAAAATCTAT 12660 
ACAATAAAGC TGTGACCCTG 

Seq ID NO: 98 Protein sequence: 
Protein Accession »: NP 00898U 

1 11 21 31 41 51 

1 I I 1 I I 

GIREFNPPRO TEINTYROS1 NEPHOSPHAT ASERECEPTO RTYPETHOMO SAPIENSMAS 60 
LAALALSLLL RLQLPPLPGA RAQSAPGG CS FDEHYSNCGY SVALGTNGFT WEQINTTEKP 1 20 
MLDQAVPTGS FMMVNSSGRA SGQKAHLLLP TLKENDTHCI DFHYYFSSRD RSSPGALNVY 1 80 
VKVNGGPQGN PVWNVSG WT EGWVKAELAI STFWPHFYQV IFESVSLKGH PGYIAVDEVR 240 
VLAHPCRKAP HFLRLQNVEV NVGQNATFQC IAGGKWSQHD KLWLQQWNGR DTALMVTRW 300 
NHRRFSATVS VADTAQRSVS KYRCVIRSDG GSG VSNYAEL IVKEPPTPIA PPELLAVGAT 360 
YLWDCPNANS IIGDGPIILK EVEYRTTTGT WAETHIVDSP NYKLWHLDPD VEYEIRVLLT 420 
RPGEGGTGPP GAPLTTRTKC ADPVHGPQNV EIVDIRARQL TLQWEPFGYA VTRCH SYNLT 480 
VQYQYVFNQQ QYEAEEVIQT SSHYTLRGLR PFMTIRLRLL LSNPEGRMES EELWQTEED 540 
VPGAVPLESI QGGPFEEKIY IQWKPPNETN GVITLYEINY KAVGSLDPSA DLSSQRGKVF 600 
KLRNETHHLP VGLYPGTTYS PTIKASTAKG FGPP VTTRIA TKISAPSMPE YDTDTPLNET 660 
DTTITVMLKP AQSRGAPVSV YQLWKEERL QKSRRAADII ECFSVPVSYR NASSLDSLHY 720 
FAAELKPANL PVTQPFTVGD NKTYNGYWNP PLSPLKSYSI YFQALSKANG ETK3NCVRLA 780 
TKAPMGSAQV TPGTPLCLLT TGASTQNSNT VEPEKQVDNT VKMAGVIAGL LMFfflLLGV 840 
MLTTKRRRNA YSYSYYLSQR KLAKKQKETQ SGAQREMGPV ASADKPTTKL SASRNDEGFS 900 
SSSQDVNGFT DGSRGELSQP TLTIQTHPYR TCDPVEMSYP RDQFQLAIRV ADLLQHTTQM 960 
KRGQGYGFKE EYEALPEGQT ASWDTAKEDE NRNKNRYGN1 ISYDHSRVRL LVLDGDPHSD 1020 
YINANYIDGY HRPRHYIATQ GPMQETVKDF WRMIWQENSA SIVMVTNLVE VGRVKCVRYW 1080 
PDDTEVYGDI KVTLIETEPL AEYVIRTFTV QKKGYHEIRE LRLFHFTSWP DHGVPCYATG 1 140 
LLGFVRQVKF LNPPEAGPIV VHCSAGAGRT GCFIAIDTML DMAENEGWD IFNCVRELRA 1200 
QRVNLVQTEE QYVFVHDAIL EACLCGNTAI PVCEFRSLYY NISRLDPQTN SSQIKDEFQT 1260 
LNI VTPRVRP EDCSIGLLPR NHDKNRSMDV LPLDRCLPFL ISVDGESSNY INAALMDSHK 1 320 
QPAAFVVTQH PLPNTVADFW RLVFDYNCSS VVMLNEMDTA QFCMQYWPEK TSGCYGPIQV 1380 
EFVSADIDED IIHRIFRICN MARPQDGYRI VQHLQYIGWP AYRDTPPSKR SLLKVVRRLE 1440 
KWQEQYDGRE GRTVVHCLNG GGRSGTFCAI CSVCEMIQQQ NIIDVFHIVK TLRNNKSNMV 1500 
ETLEQYKFVY EVALEYLSSF 

Seq ID NO: 99 DN A sequence 

Nucleic Acid Accession U: NMJJ02988.1 . 

Coding sequence: 71-340 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I t 111 I 

CCGGCACGAG AGGAGTTGTG AGTTTCCAAG CCCCAGCTCA CTCTGACCAC TTCTCTGCCT 60 
GCCCAGCATC ATGAAGGGCC TTGCAGCTGC CCTCCTTGTC CTCGTCTGCA CCATGGCCCT 120 
CTGCTCCTGT GCACAAGTTG GTACCAACAA AGAGCTCTGC TGCCTCGTCT ATACCTCCTG 1 80 
GCAG ATTCCA CAAAAGTTCA TAGTTGACTA TTCTGAAACC AGCCCCCAGT GCCCCAAGOC 240 
AGGTGTCATC CTCCTAACCA AGAGAGGCCG GCAGATCTGT GCTGACCCCA ATAAGAAGTG 300 
GGTCCAGAAA TACATCAGCG ACCTGAAGCT GAATGC CTQA GGGGCCTGGA AGCTGCGAGG 360 
GCCCAGTGAA CTTGGTGGGC CCAGGAGGGA ACAGGAGCCT GAGCCAGGGC AATGGCCCTG 420 
CCACCCTGGA GGCCACCTCT TCTAAG AGTC CCATCTGCTA TGCCCAGCCA CATTAACTAA 480 
CTTTA ATCTT AGTTTATGCA TCATATTTCA TTTTQAAATT GATTTCTATT G TTGAG CTGC 540 
ATTATG AAAT TAGTATTTTC TCTG ACATCT C ATG AC ATTG TCTTTATCAT CCTTTCCCCT 600 
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TTCCCTTCAA CTCTTCGTAC ATTCAATGCA TGGATCAATC AGTGTGATTA GCTTTCTCAG 660 
CAG ACATTGT GCCATATGTA TCAAATGACA AATCTTTATT GAATGGTTTT GCTCAGCACC 720 
ACCTTTTAAT ATATTGGCAG TACTTATTAT ATAAAAGGTA AACCAGCATT CTCACTGTGA 780 
AAAAAAAAAA AAA A AAA AAA AAA 

Seq ID NO: 100 Protein sequence: 
Protein Accession*: NP 002979.1 

1 11 21 31 41 51 

111)11 

MKGLAAALLV LVCTMALCSC AQVGTNKELC CLVYTSWQIP QKFIVDYSET SPQCPKPGVI 60 
LLTKRGRQIC ADPNKKWVQK YISDLKLNA 

Seq ID NO: 101 DN A sequence 

Nucleic Acid Accessions NM_01 5507.2 

Coding sequence: 241-1902 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CCGCAGAGGA GCCTCGGCCA GGCTAGCCAG GGCGCCCCCA GCCCCTCCCC AGGCCGCGAG 60 
CGCCCCTGCC GCGGTGCCTG GCCTCCCCTC CCAGACTGCA GGGACAGCAC CCGGTAACTG 120 
CGAGTGG AGC GGAGGACCCG AGCGGCTG AG GAGAGAGGAG GCGGCGGCTT AGCTGCTAOG 1 80 
GGGTCCGGCC GGCGCCCTCC CGAGGGGGGC TCAGGAGGAG GAAGGAGGAC CCGTGCGAGA 240 
AIGCCTCTGC CCTGGAGCCT TGCGCTCCCG CTGCTGCTCT CCTGGGTGGC AGGTGGTTTC 300 
GGGAACGCGG CCAGTGCAAG GCATCACGGG TTGTTAGCAT CGGCACGTCA GCCTGGGGTC 360 
TGTCACTATG G AACTAAACT GGCCTGCTGC TACGGCTGGA GAAGAAACAG CAAGGGAGTC 420 
TGTG AAGCTA CATGCGAACC TGGATGTAAG TTTGGTGAGT GCGTGGGACC AAACAAATGC 480 
AGATGCTTTC CAGGATACAC CGGGAAAACC TGCAGTCAAG ATGTGAATGA GTGTGGAATG 540 
AAACCCCGGC CATGCCAACA CAGATGTGTG AATACACACG GAAGCTACAA GTGCTTTTGC 600 
CTCAGTGGCC ACATGCTCAT GCCAGATGCT ACGTGTGTGA ACTCTAGGAC ATGTGCCATG 660 
ATAAACTGTC AGTACAGCTG TGAAGACACA GAAG AAGGGC CACAGTGCCT GTGTCCATCC 720 
TCAGGACTCC GCCTGGCCCC AAATGGAAGA GACTGTCTAG ATATTGATGA ATGTGCCTCT 780 
GGTAAAGTCA TCTGTCCCTA CAATCGAAGA TGTGTGAACA CATTTGGAAG CTACTACTGC 840 
AAATGTCACA TrGGTTTCGA ACTGCAATAT ATCAGTGGAC GATATGACTG TATAGATATA 900 
AATG AATGTA CTATGGATAG CCATACGTGC AGCCACCATG CCAATTGCTT CAATACCCAA 960 
GGGTCCTTCA AGTGTAAATG CAAGCAGGGA TATAAAGGCA ATGGACTTCG GTGTTCTGCT 1020 
ATCCCTGAAA ATTCTGTGAA GGAAGTCCTC AGAGCACCTG GTACCATCAA AGACAGAATC 1080 
AAGAAGTTGC TTGCTCACAA AAACAGCATG AAAAAGAAGG CAAAAATTAA AAATGTTACC 1 140 
CCAGAACCCA CCAGGACTCC TACCCCTAAG GTGAACTTGC AGCCCTTCAA CTATGAAGAG 1200 
ATAGTTTCCA GAGGCGGGAA CTCTCATGGA GGTAAAAAAG GGAATGAAGA GAAAATGAAA 1260 
GAGGGGCTTG AGGATGAGAA AAGAGAAGAG AAAGCCCTGA AGAATGACAT AGAGGAGCGA 1320 
AGCCTGCGAG GAGATGTGTT TTTCCCTAAG GTGAATGAAG CAGGTGAATT CGGCCTGATT 1380 
CTGGTCCAAA GGAAAGCGCT AACTTCCAAA CTGGAACATA AAGATTTAAA TATCTCGGTT 1440 
G ACTGCAGCT TCAATCATGG GATCTGTGAC TGGAAACAGG ATAGAGAAGA TGATTTTGAC 1500 
TGGAATCCTG CTGATCGAGA TAATOCTATT GGCTTCTATA TGGCAGTTCC GGCCTTGGCA 1 560 
GGTCACAAGA AAGACATTGG CCGATTGAAA CTTCTCCTAC CTGACCTGCA ACCCCAAAGC 1620 
AACTTCTGTT TGCTCTTTGA TTACCGGCTG GCCGGAGACA AAGTCGGGAA ACTTCGAGTG 1680 
TTTGTGAAAA ACAGTAACAA TGCCCTGGCA TGGGAGAAGA CCACGAGTGA GGATGAAAAG 1740 
TGGAAGACAG GGAAAATTCA GTTGTATCAA GGAACTGATG CTACCAAAAG CATCATTTTT 1800 
GAAGCAGAAC GTGGCAAGGG CAAAACCGGC GAAATCGCAG TGGATGGCGT CTTGCTTGTT 1860 
TCAGGCTTAT GTCCAGATAG CCTTTTATCT GTGGATGA CT GAA TGTTACT ATCTTTATAT 1920 
TTGACTTTGT ATGTCAGTTC CCTGGTTTTT TTGATATTGC ATCATAGGAC CTCTGGCATT 1980 
TTAGAATTAC TAGCTGAAAA ATTGTAATGT ACCAACAGAA ATATTATTGT AAGATGCCTT 2040 
TCTTGTATAA GATATGCCAA TATTTGCTTT AAATATCATA TCACTGTATC TTCTCAGTCA 2100 
TTTCTGAATC TTTCCACATT ATATTATAAA ATATGGAAAT GTCAGTTTAT CTCCCCTCCT 2160 
CAGTATATCT GATTTGTATA AGTAAGTTGA TGAGCTTCTC TCTACAACAT TTCTAGAAAA 2220 
TAG AAAAAAA AGCACAGAG A AATGTTTAAC TGTTTG ACTC TTATGATACT TCTTGGAAAC 2280 
TATGACATCA AAGATAGACT TTTGCCTAAG TGGCTTAGCT GGGTCTTTCA TAGCCAAACT 2340 
TGTATATTTA AATTCTTTGT AATAATAATA TCCAAATCAT CAAAAAAAAA AAAAAAAA 

Seo ID NO: 102 Protein sequence: 
Protein Accession ft NP_056322.2 

1 U 21 31 41 51 
I I I I I 1 

MPLPWSLALP UXSWVAGGF GNAASARHHG LLASARQPGV CHYGTKLACC YGWRRNSKGV 60 
CEATCEPGCK FGECVGPNKC RCFPGYTGKT CSQDVNECGM KPRPCQHRCV NTHGSYKCFC 120 
LSGHMLMPDA TCVNSRTCAM INCQYSCEDT EEGPQCLCPS SGLRLAPNGR DCLDIDECAS 180 
GKVICPYNRR CVNTFGSYYC KCHIGFELQY ISGRYDCIDI NECTMDSHTC SHHANCFNTQ 240 
GSFKCKCKQG YKGNGLRCSA IPENSVKEVL RAPGTUCDRI KKLLAHKNSM KKKAK0CNVT 300 
PEPTRTPTPK VNLQPFNYEE IVSRGGNSHG GKKGNEEKMK EGLEDEKREE KALKNDIEER 360 
SLRGDVFFPK VNEAGEFGU LVQRKALTSK LEHKJDLNISV DCSFNHG1CD WKQDREDDFD 420 
WNPADRDNAI GFYMAVPALA GHKKDIGRLK LLLPDLQPQS NFCLLFDYRL AGDKVGKLRV 480 
FVKNSNNALA WEKTTSEDEK WKTGKIQLYQ GTDATKSIIF EAERGKGKTG EIAVDGVLLV 540 
SGLCPDSLLS VDD 



Seq ID NO: 103 DNA sequence 
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Nucleic Acid Accession #: NMJ»1 565.1 

Coding sequence; 67-363 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GAOACATTCC TCAATTGCTT AO ACATATTC TGAGCCTACA GCAGAGGAAC CTCCAOTCTC 60 
AGCACCATGA ATCAAACTGC OATTCTGATT TGCTGCCTTA TCTTTCTGAC TCTAAGTGGC 120 
ATTCAAGGAG TACCTCTCTC TAGAACCGTA CGCTGTACCT GCATCAGCAT TAGTAATCAA 180 
CCTGTTAATC CAAGGTCTTT AG AAAAACTT GAAATTATT C CTGCAAGOCA ATTTTGTOCA 240 
CGTGTTG AG A TCATTGCTAC AATGAAAAAG AAGGGTGAGA AGAGATGTCT G AATCCAGAA 300 
TCGAAGGCCA TCAAGAATTT ACTGAAAGCA GTTAGCAAGG AAATGTCTAA AAGATCTCCT 360 
I^AAACCAGA GGGGAGCAAA ATCGATGCAG TGCTTCCAAG GATGGACCAC ACAGAGGCTG 420 
CCTCTCCCAT CACTTCCCTA CATGGAGTAT ATGTCAAGCC ATAATTGTTC TTAGTTTGCA 480 
GTTACACTAA AAGGTGACCA ATG ATGGTCA CCAAATCAGC TGCTACTACT CCTGTAGG AA 540 
GGTTAATGTT CATCATCCTA AGCTATTCAG TAATAACTCT ACCCTGGCAC TATAATGTAA 600 
GCTCTACTGA GGT.GCTATGT TCTTAGTGGA TGTTCTG ACC CTGCTTCAAA TATTTCCCTC 660 
ACCTTTCCCA TCTTCCAAGG GTACTAAGGA ATCTTTCTGC TTTGGGGTTT ATCAGAATTC 720 
TCAGAATCTC AAATAACTAA AAGGTATGCA ATCAAATCTG CTTTTTAAAG AATGCTCTTT 780 
ACTTCATGGA CTTCCACTGC CATCCTCCCA AGGGGCCCAA ATTCTTTCAG TGGCTACCTA 840 
CATACAATTC CA AACACATA CAGGAAGGTA GAAATATCTG AAAATGTATG TGTAAGTATT' 900 
CTTATTTAAT GAAAGACTGT ACAAAGTATA AGTCTTAGAT GTATATATTT CCTATATTGT 960 
TTTCAGTGTA CATGG AATAA .C ATGTA ATTA AGTACTATGT ATCAATGAGT AACAGOAAAA 1020 
TTTTAAAAAT ACAGATAGAT ATATGCTCTG CATGTTACAT AAGATAAATG TGCTGAATGG 1080, 
TTTTCAAATA AAAATGAGGT ACTCTCCTGG AAATATTAAG 

Seq ID HO: 104 Protein sequence; 
Protein Accession NP_001 556.1 

1 11 21 31 41 51 

IWQTAILIOC Lff LTLSG^^ 60 
EHATMKKKG EKRCLNPESK AIKNLLKAVS KEMSKRSP 

Seq ID NO: 105 DNA sequence 

Nucleic Acid Accession #: NM 01 5068.1 

Coding sequence: 1 1 70-2243 (underlined sequences correspond to start and stop codons) 

1 II 21 31 41 51 

I I I I I I 

GTAACAACCG TCACCCTGGG TCCCGACTGC CCACCTCCTC CTCCTCCCCC TCCCCCCAAC 60 
AACAACAACA ACAACAACTC CAAGCACACC GGCCATAAGA GTGCGTGTGT CCCCAACATG 120 
ACCGAACGAA GAAGGGACGA GCTCTCTGAA GAGATCAACA ACTTAAGAGA GAAGGTCATG 180 
AAGCAGTCGG AGG AGAACAA CAACCTGCAG AGCCAGGTGC AGAAGCTCAC AG AGGAGAAC 240 
ACCACCCTTC GAG AGCAAGT GGAACCCACC CCTGAGGATG AGG ATG ATG A CATCG AGCTC 300 
CGCGOTGCTG CAGCAGCTGC TGCCCCACCC CCTCCAATAG AGGAAGAGTG CCCAGAAGAC 360 
CTCCCAGAG A AGTTCGATGG CAACCCAGAC ATGCTGGCTC CTTTCATGGC CCAGTGCCAG 420 
ATCTTCATGG AAAAGAGCAC CAGGGATTTC TCAGTTGATC GTGTCCGTGT CTGCTTCGTG 480 
ACAAGCATGA TGACCGGCCG TGCTGCCCGT TGGGCCTCAG CAAAGCTGGA GCGCTCCCAC 540 
TACCTGATGC ACAACTAOCC AGCTTTCATG ATGGAAATGA AGCATGTCTT TG AAGACCCT 600 
CAGAGGCG AG AGGTTGCCAA ACGCAAGATC AGACGCCTGC GCCAAGGCAT GGGGTCTGTC 660 
ATCGACTACT CCA ATGCTTT CCAGATGATT GCCCAGG ACC TGGATTGGAA CGAGCCTGCG 720 
CTGATTGACC AGTACCACGA GGGCCTCAGC GACCACATTC AGGAGGAGCT CTCCCACCTC 780 
GAGGTCGCCA AGTCGCTGTC TGCTCTG ATT GGGCAGTGCA TTCACATTG A GAGAAGGCTG 840 
GCCAGGGCTG CTGCAGCTCG CAAGCCACGC TCGCCACCCC GGGCGCTGGT GTTGCCTCAC 900 
ATTGCAAGCC ACCACCAGGT AGATCCAACC GAGCCGGTGG GAGGTGCCCG CATGCGCCTG 960 
ACGCAGGAAG AAAAAGAAAG ACGCAGAAAG CTGAACCTGT GCCTCTACTG TGGAACAGGA 1020 
GGTCACTACG CTGACAATTG TCCTGCCAAG GCCTCAAAGT CTTCGCCGOC GGGAAACTCC 1080 
CCGGCCCCGC TGTAGAGGG A CCTTCAGCGA CCGGGCCAG A AATAATAAGG TCCCCACAAG 1 140 
ATGATGCCTC ATCTCCACAC TTGCAAGTO A TGC TCCAGAT TCATCTTCCG GGCAGACACA 1200 
CCCTGTTCGT CCGAGCCATG ATCGATTCTG GTGCTTCTGG CAACTTCATT GATCACGAAT 1260 
ATGTTGCTCA AAATGGAATT CCTCTAAGAA TCAAGGACTG GCCAATACTT GTGGAAGCAA 1320 
TTGATGGGCG CCCCATAGCA TCGGGCCCAG TTGTCCACGA AACTCACGAC CTGATAGTTG 1380 
ACCTGGGAGA TCACCGAGAG GTGCTGTCAT TTGATGTGAC TCAGTCTCCA TTCTTCCCTG 1440 
TCGTCCTAGG GGTTCGCTGG CTGAGCACAC ATGATCCCAA TATCACATGG AGCACTCGAT 1500 
CTATCGTCTT TGATTCTGAA TACTGCCGCT ACCACTGCCG GATGTATTCT CCAATACCAC 1S60 
CATCGCTCCC ACCACCAGCA CCACAACCGC CACTCTATTA TCCAGTAGAT GGATACAGAG 1620 
TTTACCAACC AGTG AGGTAT TACTATGTCC AGAATGTGTA CACTCCAGTA GATGAGCACG 1680 
TCTACCCAGA TCACCGCCTG GTTGACCCTC ACATAGAAAT GATACCTGGA OCAC ACAGT A 1740 
TTCCCAGTGG ACATGTGTAT TCACTGTCCG AACCTGAAAT GGCAGCTCTT CGAGATTTTG 1800 
TGGCAAGA AA TGTAAAAGAT GGGCTAATTA CTCCAACGAT TGCACCTAAT GGAGCCCAAG 1 860 
TTCTCCAGGT GAAGAGGGGG TGGAAACTGC AAGTTTCTTA TGATTGCCGA GCTCCAAACA 1920 
ATTTTACTAT CCAGAATCAG TATCCTCGCC TATCTATTCC AAATTTAGAA GACCAAGCAC 1980 
ACCTGGCAAC GTACACTG AA TTCGTACCTC AAATACCTGG ATACCAAACA TACCCCACAT 2040 
ATGCCGCGTA CCCGACCTAC CCAGTAGGAT TCGCCTGGTA CCCAGTGGGA CGAGACGGAC 2100 
AAGGAAG ATC ACTATATGTA CCTGTGATGA TCACTTGGAA TCCACACTGG TACCGCCAGC 2160 
CTCCGGTACC ACAGTACCCG CCGCCACAGC CGCCGCCTCC ACCACCACCA CCGCCGCCGC 2220 
CTCCATCTTA CAGTACCCTG TAAATACCTG TCATGTCCTT CAGGATCTCT GCCCTCAAAA 2280 

394 



WO 02/059377 



TTTATTCCTG TTCAGCTTCT CAATCAGTGA CTGTGTGCTA AATTTTAGGC TACTGTATCT 2340 
TCAGGCCACC TGAGGCACAT CCTCTCTG AA ACGGCTATGG AAGGTTAGGG CCACTCTGGA 2400 
CTGGCACACA TCCTAAAGCA CCAAAAGACC TTCAACATTT TCTGAGAGCA ACAGAGTATT 2460 
TGCCAATAAA TGATCTCTCA TTTTTCCACC TTGACTGCCA ATCTA ACTAA AATAATTAAT 2520 
AAGTTTACTT TCCAGCCAGT CCTGGAAGTC TGGGTTTTAC CTGCCAAAAC CTCCATCACC 2580 
ATCTA AATTA TAGGCTGCCA AATTTGCTGT TTAACATTTA CAGAGAAGCT GATACAAAOG 2640 
CAGGA AATGC TO ATTTCTTT ATGGAGGGGG AGACGAGGAG G AGGAGG ACA TGACTTTTCT 2700 
TGCGGTTTCG GTACCCTCTT TTTAAATCAC TGGAGGACTG AGGCCTTATT AAGGAAGCCA 2760 
AAATTATCGG TGCAGTGTGG AAAGGCTTCC GTGATCCTCT CGCTGCACCC TTAGAAACTT 2820 
CACCGTCTTC AAACTCCATT TCCATGGTTC TGTTAATTCT CAAGG AGCAG CAACTCGACT 2880 
GGTTCTCCCA GGAGCAGGAA AAACCCTTGT GACATGAAAC ATCTCAGGCC TGAAAAGAAA 2940 
GTGCTCTCTC AGATGGACTC TTGCATGTTA AGACTATGTC TTCACATCAT GGTGCAAATC 3000 
ACATGTACCC AATGACTCCG GCTTTGACAC AACACCTTAC CATCATCATG CCATGATGGC 3060 
TTCCACAAAG CATTAAACCT GGTAACCAGA GATTACTGGT GGCTCCAGCG TTGTTAGATG 3120 
TTCATGAAAT GTGACCACCT CTCAATCACC TTTGAGCGCT AAAGAGTAGC ACATCAAAAG 31 80 
GACTCCAAAA TCCCATACCC AACTCTTAAG AGATTTGTCC TGGTACTTCA GAAAGAATTT 3240 
TCATGAGTGT TCTTAATTGG CTGGAAAAGC ACCAGCTGAC GTTTTGGAAG AATCTATCCA 3300 
TGTGTCTGCC TCCATATGCA TCTGGGCATT TCATCTTCAG TCCCCTCATT AGACTGTAGC 3360 
ATTAGGATGT GTGGAGAGAG GAGAAATGAT TTAGCACCCA GATTCACACT CCTATGCCTG 3420 
GAAGGGGGAC ATCTTTGAAG AAGAGGAATT AGGGCTGTGG ACACTGTCTT GAGGATGTGG 3480 
ACTTCCTTAG TGAGCTCCAC ATTACTTGAT GGTAACCACT TCAAAAGGAT CAGAATCCAC 3540 
GTAATGAAAA AGGTCOCTCT AG AGGATGG A GCTG ATGTGA AGCTGCCAAT GGATGAAAAG 3600 
, CCTCAGAAAG CAACTCAAAG GACTCAAAGC AACGGACAAC ACAAGAGTTG TCTTCAGCCC 3660 
AGTGACACCT CTGATGTCCC CTGGAAGCTT TGTGCTAACC TGGGACTGCC TGACTTCCTT 3720 
TAGCCTGGTC CCTTGCTACT ACCTTG AACT GTTTTATCTA ACCTCTCTTT TTCTGTTTAA 3780 
TTCTTTGCTA CTGCCATTG A COCTGCTGCA GGATTTGTGT CATTTTCCTG OCTGGTTGCT 3840 
GAGACTCCAT TTTGCTGCCA CACACAGAGA TGTAAGAGGC AGGCTTTAAT TGCCAAAGCA 3900 
CAGTTTGAGC AGTAGAAAAC AACATGGTGT ATATCTCAAA TTGCCTGACA TGAAGAGGAG 3960 
TCTAACOGTG AAGTTTCACT TTTCATCAGC ATCATCTTTC ACATGTTCAT TA TCATC CGC 4020 
TCTTATTCTT GC ATGTTTAA ACACTTAAAA TTTTTAGTAT AATTTTTAGT GTGTTTTGAA 4080 
GTGGTGACTA GGCTTTCAAA AACTTCCATT G AATTACAAA GCACTATCCA GTTCTTATTG 4140 
TTAAACTAAG TAAAAATGAT AAGTA ACATA GTGTAAAATA TTCCTTTACT GTGAACTTCT 4200 
TACAATGCTG TGAATGAGAG GCTCCTCAGA ACTGGAGCAT TTGTATAAT A ATT CATCCTG 4260 
TTCATCTTCA ATTTT AAC AT CAT AT ATAAT TTCAATTCTA TCAATTGGGC CTTTAAAAAT 4320 
CATATAAAAG GATATAAAAT TTGAAAAGAG AAACCTAATT GGCTATTT AA TCC AAAACAA 4380 
Cl ' ITlTl l Tl TCCTTCAATG GAATCAGAAA GCTTGTCAAT CACTCATGTG TTTTA GAGT A 4440 
ATTAC H 11A AAATGGTGCA TTTGTGCTTC TGAACTATTT TGA AGAGTC A CTTCTGTTTA 4500 
CCTCAAGTAT CAATTCATCC TCCATACATT TGAATTCAAO TTGTTTTTTG TCAA ATTTAC 4560 
AGTTGTCAAT TG ATCTTCAA GCTGCAGGGT GCCTAGAAAT GGGCCGTTGT CTGTAGCCCT 4620 
GGCATGTGCA CACGG ACATT TGCCACCACT GCAAGCAAAA GTCTGGAGA A GTTCACCAAC 4680 
GACAAGAACG ATTAGGGAAA ATATGCTGCT GTGGGTTAAC AACTCAGAAA GTCCCTGATC. 4740 
CACATTTGGC TGTTTACTAA AGCTTGTOAT TAACTTTTTG GCAGTGTGTA CT ATGCTCTA 4800 
TTGCTATATA TGCTATCTAT AAATGTAGAT GTTAAGGATA AGTAATTCTA AATTTATTAT 4860 
TCTATAGTTT TGAAGTTTGG TTAAGTTTCC TTTCACTCAA TTGATTTATT TTGTTGTTAA 4920 
TCAAATTTAT GTTAATTGGA TCCTTTAAAT TTTTTTTGGC ATTTTCCAAC AAAAATGGCT 4980 
TTATTCATAA GAAAGGAAAA AAATCAATGG AATTTGATAT CTAAAG AAGT TAG AAAGGGA 5040 
GCAAAATAAA AAACATAAAG GAOATAGATG AATTAGTAAG CAAATCAGTA GTCGAGTTTT 5100 
TCAAACTGGC AAAATTAATT AATTGACTTT TAGCCCAAAT TTACATTGTT AATTAAATCA 5160 
AGAAGOAAGA AGATCTAAGA GCTCCCATTG ATAGGCAAGC CTAGAGAGAA CTAGCTAAAT 5220 
TTATCATGCT AGGATATTG A AACACAGAAA GTTTACATAC ATTTATGAAG GGTCAATTTA 5280 
GTTTGGACAG TGAGGTATTT GTCTTAGTGG AAAAAAGGAG AATTAGTCTG ATCAAATCGT 5340 
GAAGTAATAC AGTGAACTTG CAGGTGCACA AAATAAGAGG GCCACATCTA TATGGTGCAG 5400 
TCTGGAATTC TGTTTAAGTT TGTAGGTACC TCTTGGACTT CTGAATTGAT CCAGTTGTCA 5460 
TCCACCACAG ACATCTCACA TCAGATACAG ACAGTTCCAA GATTGACAAC AG AGAAC AAC 5520 
CTGCTGGAAA GACCTGGGCA GAAATGGAGA GCCCTGCGGG AACCATGCTA CATTTTCATC 5580 
TAAAGAGAGA ATGCACATCT GATGAGACTG AAAGTTCTTT GTTGTTTTAG ATTGTAGAAT 5640 
GGTATTG AAT TGGTCTGTGG AAAATTGCAT TGCTTTTATT TCTTTGTGTA ATCAAGT TTA 5 700 
AGTAATAGGG GATATATAAT CATAAGCATT TTAGGGTGGG AGGGACTATT AAGTAATTTT 5760 
AAGTGGGTGG GGTTATTTAG AATGTTAGAA TAATATTATG TATTAGATAT CGCTATAAGT 5820 
GGACATGCGT ACTTACTTGT AACCCTTTAC CCTATAATTG CTATCCTTAA AGATTTCAAA 5880 
TAAACTCGGA GGGAACTGCA GGGAGACCAA CTTATTTAGA GCGAATTGGA CATGGATAAA 5940 
AACCCCAGTG GG AGAAAGTT CAAAGGTG AT TAG ATTAATA ATTTAATAGA GGATGAGTGA 6000 
CCTCTG ATAA ATTACTGCTA G AATGAACTT GTCAATG ATG GATGGTAAAT TTTCATGGAA 6060 
GTTATAA AAG TGATAAATAA AAACCCTTGC TTTTACCCCT GTCAGTAGCC CTCCTCCTAC 6120 
CACTG AACCC CATTGCCCCT ACCCCTCCTT CTAACTTTAT TGCTGTATTC TCTTCACTCT 6180 
ATATTTCTCT CTATTTGCTA ATATTGCATT GCTGTTACAA TA A AAATTCA ATAAAG ATTT 6240 
AGTGGTTAAG TGC 

Scq ID NO: 106 protein sequence; 
Protein Accession #: NPJJ55B83.1 

1 11 21 31 41 51 

MTERRRDELS EEINNLREKV MKQSEENNNL QSQ VQKLTEE NTTLREQVEP TPEDEDDDIE 60 
LRG AAAAAAP PPPIEEECPE DLPEKFDGNP DMLAPFMAQC QIFMEKSTRD FSVDRVRVCF 120 
VTSMMTGRAA RWASAKLERS HYLMHNYPAF MMEMKHVFED PQRREVAKRK IRRLRQGMQS 1 80 
VIDYSNAFQM IAQDLDWNEP ALIDQYHEGL SDH1QEELSH LEVAKSLSAL IGQCMERR 240 
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LARAAAARKP RSPPRALVLP HIASHHQVDP TEPVQOARMR LTQEEKERRR KLNLCLYCGT 300 
GGHYADNCPA KASKSSPAGN SPAPL 

Seq ID NO: 107 DNA sequence 

Nucleic Acid Accession #: NM JJ03679. 1 

Coding sequence: 47-1 507(underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

GGCACGAGCA GAAGCAACAA TAATTGTGAA AAATACTTCA GCAGTTATGG ACTCATCTGT 60 
CATTCA AAGG AAAAAAGTAG CTGTCATTGG TGGTGGCTTG GTTGGCTCAT TACA AGCATG 120 
CTTTCTTGCA AAGAGGAATT TCCAGATTGA TGTATATGAA GCTAGGGAAG ATACTCGAGT 180 
GGCTACCTTC ACACGTGGAA GAAGCATTAA CTTAGCCCTT TCTCATAGAG GACGACAAGC 240 
CTTGAAAGCT GTTGGCCTGG A AGATCAGAT TGTATCCCAA GGTATTCCCA TGAGAGCAAG 300 
AATGATCCAC TCTCTTTCAG G AAAAAAGTC TGCAATTOCC TATGGG ACAA AGTCTCAGTA 360 
TATTCTTTCT GTAAGCAGAG AAAATCTAAA CAAGOATCTA TTGACTGCTG CTGAGAAATA 420 
CCCCAATGTG AAAATGCACT TTAACCACAG GCTGTTG AAA TGT AATCCAG AGG AAGGAAT 480 
G ATCACAGTG CTTGGATCTG ACAAAGTTCC CAAAGATGTC ACTTGTGACC TCATTGTAGG 540 
ATGTG ATGGA GCCTATTCAA CTGTCAGATC TCACCTGATG AAGAAACCTC GCTTTGATTA 600 
CAGTCAGCAG TACATTCCTC ATGGGTACAT GGAGTTGACT ATTCCACCTA AG AACGGAGA 660 
TTATGCCATG GAACCTAATT ATCTGCATAT TTGGCCTAGA AATACCTTTA TGATGATTGC 720 
ACTTCCTAAC ATGAACAAAT CATTCACATG TACTTTGTTC ATGCCCTTTG AAGAGTTTGA 780 
AAAACTTCTA ACCAGTAATG ATGTGGTAGA TTTCTTCCAG AAATACTTTC CGGATGCCAT 840 
CCCTCTAATT GGAGAGAAAC TCCTAGTGCA AGATTTCTTC CTGTTGCCTG CCCAGCCCAT 900 
GATATCTGTA AAGTGCTCTT CATTTCACTT TAAATCTCAC TGTGTACTGC TGGG AGATGC 960 
AGCTCATGCT ATAGTGCCGT TTTTTGGGCA AGGAATGAAT GCGGGCTTTG AAGACTGCTT 1020 
GGTATTTGAT GAGTTAATGG ATAAATTCAG TAACGACCTT AGTTTGTGTC TTCCTGTGTT 1080 
CTCAAGATTG AGAATCCCAG ATGATCACGC GATTTCAGAC CTATCCATGT ACAATTACAT 1 140 
AGAGATGCGA GCACATGTCA ACTCAAGCTG GTTCATTTTT CAGAAG AACA TGGAGAGATT 1200 
TCTTCATGCG ATTATGCCAT CGACCTTTAT CCCTCTCTAT ACAATGGTCA CTTTTTCCAG 1260 
AATAAGATAC CATGAGGCTG TGCAGCGTTG GCATTGGCAA AAAAAGGTGA TAAACAAAGG 1320 
ACTCTTTTTC TTGGGATCAC TGATAGCCAT CAGCAGTACC TACCTACTTA TACACTACAT 1380 
GTCACCACGA TCTTTCCTCT GCTTGAGAAG ACCATGGAAC TGGATAGCTC ACTTCCGGAA 1440 
TACAACATGT TTCCCCGCAA AGGCCGTGGA CTCOCTAGAA CAAATTTCCA ATCTCATTAG 1 500 
CAGGTGATAG AAAGGTTTTG TGGTAGCAAA TGCATGATTT CTCTGTGACC AAAATTAAGC 1560 
ATG AAA AAAA TGTTTCCATT GCCATATTTG ATTCACTAGT GG AAGATAGT GTTCTGCTTA 1 620 
TAATTAAACT GAATGTAGAG TATCTCTGTA TGTTAATTGC AATTACTGGT TGGGGGGTGC 1680 
ATTTTAAAAG ATGAAACATG CAGCTTCCCT ACATTACACA CACTCAGGTT GAGTCATTCT 1740 
AACTATAAAA GTGCAATG AC TAAGATCCTT CACTTCTCTG AAAGTAAGGC CCTAG ATGCC 1 800 
TCAGGG AAGA CAGTAATCAT GCCTTTTCTT TAAAAGACAC AATAGGACTC GCAACAGCAT 1 860 
TGACTCAACA CCTAGGACTA AAAATCACAA CTTAACTAGC ATGTTAACTG CACTTTrCAT 1920 
TACGTGAATG GAACTTACCT AACCACAGGG CTCAGACTTA CTAGATAAAA CCAGAAATGG 1980 
AAATAAGGAA TTCAGGGGAG TTCCAGAGAC TTACAAAATG AACTCATTTT ATTTTCCCAC 2040 
CTTCAAATAT AAGTATTATC ATCT ATCTGT TTATCGTCTA TCTATCTATC ATCTATCTAT 2100 
CTATCTATCA TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTATCTATC 2160 
TCTATTTATT TATGTATTTA G AGATCAGGT CTCACTCTGT TG ACCAGGCT GGAGTGCAGT 2220 
GGTGAG ATCT GGGTTCACTG CAACCTCTGC CTCCTGGGCT CAAGCAATCC TCCCACTTCA 2280 
GCCTCCCAAA TAGCTGGGGC TACCATGGTA TTTTTCAGTA GAGACCGGGT CTTGCCATGC 2340 
TGCCCAGGCC AGTCTCAAAC TCCTGGCCTC ATGTGATCTG CCCACCTCAG CCTCCCAAAG 2400 
TACAGGGATT AGAGTTGTGA GCCACCGCTG CCAGCCCAGA GTTACCCTCT AAAGATAAGA 2460 
AAAAGGCTAT TAATATCATA CTAAGTG AAG GACAGGAAAG GGTTTTATTC ATAAATTAAA 2520 
TGTCTACATG TGCCAGAATG GAAAGGAAAC AAGGGGAGAC AACTTTTATA GAAATACAAA 2580 
GCCATTACTT TATTCAATTT CAGACCCTCA GAAGCAATTT ACTAATTTAT TCTTCGACTA 2640 
CATACTGCAG CAGAACCAGC AATACACTTG ATTTTTAAAA GCACATTTAG TGAAATGTTT 2700 
TCTTTGGTTC ATCCTTCTTT AACAGGCTGC TGAGTCACTC AGAAATCCTT CAAACATGAT 2760 
TAATTATG AA GATG AAACAC TAG AGTCATA TAAGAAATAA AAATTGGGCA ATAAA ATAAA 2820 
ATGATTCAGT GTTTCTTTTC TATATTGTCA ATGAAAACCT TGAGTTCTAA TAATCCATGT 2880 
TCAGTTTGTA GGGAAAGAAA AAATAATTTT TCCTTCTACC CACTTTAGGT TCCTTGGCTG 2940 
GGGCCCCTAT AACAAAAGAC AGATTGACAA GAGAAAAACA AACATAAATT TATTAGCGGG 3000 
TATATGTAAT ATATATGTGG GAAATACAGG GGAATGAGCA AATCTCAAAG AGCTGGCGTC 3060 
TTAGAACTCC CTGGCTTATA TAGCATCGAC AAAGAACAGT AAATTTTTAG AGAAACAACA 3120 
AAACAAAGAA AAAGAGCTTT GAGTCTGTAG GGGCAGCAAT TTGGGGGAAG CAAATATATG 3180 
GG AGTTTGCC TTGTAG ATTC CTCTGGTGGT GGTCTCCAGG CTGACAAGGA TTCAAAGTTG 3240 
TCTCTG AAAC TCCTCTTTGT CATACTGCAC ATATAAAACG TCTTTTGTTT CCAACAAGAG 3300 
GATTTCTTTT TCATTCTAGA ATTATCTCCT TGATAACTTG ATCAGATATA GGACATGACA 3360 
CTG AATAGAG TCCAACAGTA CAAAAAAAAT TCAGTATGTT CTAGCTACTT CACACATGTG 3420 
TACGCGACAG TTATTTTTAC AGTAAGGTAT TTTCGAGAAA AATGCATTAC GTGTTTTGGA 3480 
AAATAGAGTA ATTTAAAA AA TATATTTGAA ATGAAAATCT CCAACACATT AG AAGATG AT 3540 
G ATGTTAG AT GCCC ATCG TG TGCCACAAGT GGTTTTTTCA TTATGTAAAG CACCCGTTGA 3600 
ATTAAAAGAA TTTGTTTTTG TTCAACCTCT TCCTGAGGCC C AAG AG CAT A TGGGCAATTC 3660 
GGATTTCCTG CTGGACCACA AGGTTCTGTT GATATTACAT AGAAACGGGT ATTCCAGACA 3720 
CTTCTTATGA TG AAAGTCCA A AAGTGGCAT CCAATTTAAG GCCCCATCTT TCGTTGCCAT 3780 
TCTTCATTCC TACAAAGGAC GAACTTGG AT TACATCAACT TTGGACCCAT TGGTTTTGTC 3840 
GCTGTCGTCA ACTGA CAOTG ATTCACCACT GGTGATGATA AAAATGATGG AAG AAGAGTT 3900 
GAAAGTCACT TTTTTCTTTG GCCTGTCCCC ATCTTTCTGT GACATCACAA TGGGTCTGAT 3960 
CTGCATTTCA CTTCCAGCTG CTGGTAGGTC TTTAGCAGGC CTCTGGCACC TCAGCAGTCG 4020 
GAGGCACAGA AGCTGCAAAA GGGATCTTCG AAACTGGGCA GAGAAAAAAT AAAGTGG AAT 4080 
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ATTAAGTAAA AGTTGGGCAC TAATCTGGAT TAACATTCGA GGAAATCAGT TGAGCTGATT 4140 
TAAGTTGTTT TTTGTTTGTT AGCAGGTGTG GATGTGGGGT TATGTGGTCA TGCTCAGATC 4200 
TACClTAAATC ACCCCAGAGC TTTATGTCTT TTATTCATTC TAAATCTTAT TAACCGGAAT 4260 
ATGTAGG ACC ATTTCAATAC CTTGTAATCC TCCAAGCTTC AATCTGCACA CACTTTCTAT 4320 
GAGGGCAGGT ACAACTATTA AGAGATTTTG AACATTAAGT TAGTCCACA A ATATTCAGTG 4380 
GGCATCTACT AGGTG ACAGC CACTGTGCTA TAATTAGAGA CTTTTTACTA TAAGCATCAA 4440 
AAACAGATAA GGCTCTTCCT GGCAGAGTTT ACAGCCTGGT GTACTTGCTA ATGTCTCTTT 4500 
AATTAGGTGA AGAATTTTTT TTTTCTATCG A AATTACTAA TCAGTTGGGG AAAAAAATAC 4560 
TATAGCAGAC AGCACTAATG TCATCAACAA ACATTGTTCT TCTCCGTGTC CTGGGTACAA 4620 
CATCG AATAA TATTTCTTGG CCTCCTTTCC GCTTCTCCTC TCTGCTGTTC CTCTCTACAA 4680 
GAACCTGGGA GGCCAACGCC TAAAGATCAT AATATCACAA TGGAAGG AAC CTAGATTCCT 4740 
AAATG ACTGC ATAGGAGAGA TCCCATCTCC TCCACCCAAT ACATTATTAG ACTGAACTGT 4800 
GACCTGAAAT GAGCAATAAA CTCTGTATTA ATTCACTGAA ATGTTGGGGT TGCTTGTTAT 4860 
AGTAGTCGGT CCATCATGAC CAGTAAA ACA TAAATCAAAA GTTAATGTAA TTGTTATCCC 4920 
ATTATTTAGA GCGAA ATAAA TGTTG AATAT ATGGACTTTC TCAGATTAGG AAATACCAAT 4980 
TAAAAATATA ATAAATAGCT 

Sea ID NO: 108 Protein sequence: 
Protein Accession #: NP_003670.l 

1 11 21 31 41 51 
I I I I I I 

MDSSVIQRKK VAVIGGGLVG SLQACFLAKR NFQIDVYEAR EDTRVATFTR GRSTNLALSH 60 
RGRQALKAVG LEDQIVSQGI PMRARMIHSL SGKKSAIPYG TKSQYILSVS RENLNKDLLT 120 
AAEKYPNVKM HPNHRLLKCN PEEGMITVLG SDKVPKDVTC DLtVGCDGAY STVRSHLMKK 180 
PRFDYSQQYI PHGYMELTIP PKNGDYAMEP NYLHIWPRNT FMM1ALPNMN KSFTCTLFMP 240 
FEEFEKLLTS NDWDFFQKY FPDAIPLIGE KLLVQDFFLL PAQPMISVKC SSFHFKSHCV 300 
LLGDAAHAIV PFFGQGMNAG FEDCLVFDEL MDKFSNDLSL CLPVFSRUU PDDHAISDLS 360 
MYNYIEMRAH VNSSWFIFQK NMERFLHAIM PSTFIPLYTM VTFSRIRYHE AVQRWHWQKK 420 
VINKGLFFLG SLIAISSTYL UHYMSPRSF LCLRRPWNW1 AHFRNTTCFP AKAVDSLEQI 480 
SNUSR 

Seq ID NO: 109 DNA sequence 

Nucleic Acid Accession #: NM_006 1 1 5. 1 

Coding sequence: 236-1765 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GCTTCAGGGT ACAGCTCCCC CGCAGCCAGA AGCCGGGCCT GCAGCCCCTC AGCACCGCTC 60 
CGGGACACCC CACCCGCTTC CCAGGCGTGA CCTGTCAACA GCAACTTCGC GGTGTGGTGA 120 
ACTCTCTG AG GAAAAACCAT TTTGATTATT ACTCTCAGAC GTGCGTGGCA ACAAGTG ACT 1 80 
GAGACCTAGA AATCCAAGCG TTGGAGGTCC TGAGGCCAGC CTAAGTCGCT TCAAAAJGGA 240 
ACGAAGGCGT TTGTGGGGTT CCATTCAG AG CCGATACATC AGCATGAGTG TGTGGACAAG 300 
CCCACGGAGA CTTGTGGAGC TGGCAGGGCA GAGCCTGCTG AAGGATGAGG CC CTGGC CAT 360 
TGCCGCCCTG GAGTTGCTGC CCAGGGAGCT CTTCCCGCCA CTCTTCATGG CAGCCTTTG A 420 
CGGGAGACAC AGCCAGACCC TGAAGOCAAT GGTGCAGGCC TGGCCCITCA CCTGCCTCCC 480 
TCTGGGAGTG CTGATGAAGG GACAACATCT TCACCTGGAG ACCTTCAAAG CTGTGCTTGA 540 
TGGACTTGAT GTGCTCCTTG CCCAGG AGGT TCGCCCCAGG AGGTGGAAAC TTCAAGTGCT 600 
GGATTTACGG AAGAACTCTC ATCAGG ACTT CTGGACTGTA TGGTCTGGAA ACAGGGCCAG 660 
TCTGTACTCA TTTCCAGAGC CAGAAGCAGC TCAGCCCATG ACAAAGAAGC GAAAAGTAQA 720 
TGGTTTGAGC ACAGAGGCAG AGCAGCCCTT CATTCCAGTA GAGGTGCTCG TAGACCTGTT 780 
CCTCAAGGAA GGTGCCTGTG ATGAATTGTT CTCCTACCTC ATTGAGAAAG TGAAGCGAAA 840 
GAAAAATGTA CTACGCCTGT GCTGTAAGAA GCTGAAGATT TTTGCAATGC CCATGCAGGA 900 
TATCAAGATG ATCCTGAAAA TGGTGCAGCT GGACTCTATT GAAGATTTGG AAGTG ACTTG 960 
TACCTGG AAG CTACCCACCT TGGCGAAATT TTCTCCTTAC CTGGGCCAGA TG ATTAATCT 1020 
GCGTAGACTC CTCCTCTCCC ACATCCATGC ATCTTCCTAC ATTTCCCCGG AGAAGGAAGA 1080 
GCAGTATATC GCCCAGTTCA CCTCTCAGTT CCTCAGTCTG CAGTGCCTGC AGGCTCTCTA 1 140 
TGTGGACTCT TTATTTTTCC TTAGAGGCCG CCTGGATCAG TTGCTCAGGC ACGTGATGAA 1200 
CCCCTTGGAA ACCCTCTCAA TAACTAACTG CCGGCTTTCG GAAGGGGATG TG ATGCATCT 1260 
GTCCCAGAGT CCCAGCGTCA GTCAGCTAAG TGTCCTGAGT CTAAGTGGGG TCATGCTGAC 1320 
CGATGTAAGT CCCGAGCCCC TCCAAGCTCT GCTGGAGAGA GCCTCTGCCA CCCTCCAGGA 1 380 
CCTGGTCTTT GATGAGTGTG GGATCACGGA TGATCAGCTC CTTGCCCTCC TGCCTTCCCT 1440 
GAGCCACTGC TCCCAGCTTA CAACCTTAAG CTTCTACGGG AATTCCATCT CCATATCTGC 1500 
CTTGCAGAGT CTCCTGCAGC ACCTCATCGG GCTGAGCAAT CTGACCCACG TGCTGTATCC 1560 
TGTCCCCCTG GAGAGTTATG AGGACATCCA TGGTACCCTC CACCTGGAGA GGCTTGCCTA 1620 
TCTGCATGCC AGGCTCAGGG AGTTGCTGTG TGAGTTGGGG CGGCCCAGCA TGGTCTGGCT 1680 
TAGTGCCAAC CCCTGTCCTC ACTGTGGGGA CAGAACCTTC TATGACCCGG AGCCCATCCT 1740 
GTGCCCCTGT TTCATGCCTA ACTAGCTGGG TGCACATATC AAATGCTTCA TTCTGCATAC 1 800 
TTGGACACTA AAGCCAGGAT GTGCATGCAT CTTGAAGCAA CAAAGCAGCC ACAGTTTCAG 1860 
ACAAATGTTC AGTGTGAGTG AGGAAAACAT GTTCAGTGAG GAAAAAACAT TCAGACAAAT 1920 
GTTCAGTGAG GAAAAAAAGG GGAAGTTGGG GATAGGCAGA TGTTG ACTTG AGGAGTTAAT 1980 
GTG ATCTTTG GGGAG ATACA TCTTATAG AG TTAGAAATAG AATCTG AATT TCTAAAGGGA 2040 
GATTCTGGCT TGGGAAGTAC ATGTAGGAGT TAATCCCTGT GTAGACTGTT GTAAAGAAAC 2100 
TGTTG AAAAT AAAGAGAAGC AATOTGAAGC AAAAAAAAAA AAAAAAAA 

Reg ID NO; 1 10 Protein sequence: 
Protein Accession #: NP_006106.1 
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1 11 21 31 41 SI 
I I I I I I 

MERRRLWGSI QSRYISMSVW TSPRRLVELA GQSLLKDEAL AIAALELLPR ELFPPLFMAA 60 
FDGRHSQTLK AM VQAWPFTC LPLGVLMKGQ HLHLETFKA V LDGLDVLLAQ EVRPRRWKLQ 1 20 
VLDLRKNSHQ DFWTVWSGNR ASLYSFPEPE AAQPMTKKRK VDGLSTEAEQ PFIPVEVLVD 1 80 
LFLKEGACDE LFSYLIEKVK RKKNVLRLCC KKLKIFAMPM QDIKM1LKMV QLDSIEDLEV 240 
TCTWKLPTLA KFSPYLGQMI NLRRIXLSHI HASSYISPEK EEQYIAQFTS QFLSLQCLQA 300 
LYVDSLFFLR GRLDQLLRHV MNPLETLSIT NCRLSEGDVM HLSQSPSVSQ LSVLSLSGVM 360 
LTDVSPEPLQ ALLERASATL QDLVFDECGI TDDQLLALLP SLSHCSQLTT LSFYGNSISI 420 
SALQSLLQHL IGLSNLTHVL YPVPLESYED IHGTLHLERL AYLHARLREL LCELGRPSMV 480 
WLSANPCPHC GDRTFYDPEP ILCPCFMPN 

Seq ID NO: 1 It DNA sequence 

Nucleic Acid Accession #: NM_003 8 1 5 

Coding sequence: 8-2452 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
I I I I I I 

CGCTGCCATG CGGCTGGCGC TGCTCTGGGC CCTGGGGCTC CTGGGCGCGG GCAGCCCTCT 60 
GCCTTCCTGG CCGCTCCCAA ATATAGGTGG CACTGAGGAG CAGCAGGCAG AGTCAGAGAA 120 
GGCCCCG AGG G AGCCCTTGG AGCCCCAGGT CCTTCAGGAC G ATCTCCCAA TTAGCCTCAA 1 80 
AAAGGTGCTT CAGACCAGTC TGCCTGAGCC CCTGAGGATC AAGTTGGAGC TGGACGGTGA 240 
CAGTCATATC CTGGAGCTGC TACAGAATAG GGAGTTGGTC CCAGGCCGCC CAACCCTGGT 300 
GTGGTACCAG CCCG ATGGCA CTCGGGTGGT CAGTG AGGGA CACACTTTGG AGAACTGCTG 360 
CTACCAGGGA AGAGTGCGGG GATATGCAGG CTCCTGGGTG TCCATCTGCA CCTGCTCTGG 420 
GCTCAOAGGC TTGGTGGTCC TGACCCCAOA GAGAAGCTAT ACCCTGGAGC AGGGGCCTGG 480 
GGACCTTCAG GGTCCTCCCA TTATTTCGCG AATCCAAGAT CTCCACCTGC CAGGCCACAC 540 
CTGTGCCCTG AGCTGGCGGG AATCTGTACA CACTCAGACX5 CCACCAGAGC ACCCCCTGGG 600 
ACAGCGCCAC ATTCGCCGG A GGCGGGATGT GGTA ACAGAG ACCAAGACTG TGGAGTTGGT 660 
GATTGTGGCT GATCACTCGG AGGCCCAGAA ATACCGGGAC TTCCAGCACC TGCTAAACCG 720 
CACACTGGAA GTGGCCCTCT TGCTGG ACAC ATTCTTCCGG CCCCTGAATG TACGAGTGGC 780 
ACTAGTGGGC CTGGAGGCCT GGACCCAGCG TGACCTGGTG GAGATCAGCC CAAACCCAGC 840 
TGTCACCCTC GAAAACTTCC TCCACTGGCG CAGGGCACAT TTGCTGCCTC GATTGCCCCA 900 
TGACAGTGCC CAGCTGGTGA CTGGTACTTC ATTCTCTGGG CCTACGGTGG GCATGGCCAT 960 
TCAGAACTCC ATCTGTTCTC CTGACTTCTC AGGAGGTGTG AACATGGACC ACTCCACCAG 1020 
CATCCTGGGA GTCGCCTCCT CCATAGCCCA TGAGTTGGGC CACAGCCTGG GCCTGGACCA 1080 
TGATTTGCCT GGGAATAGCT OCCCCTGTCC AGGTCCAGCC CCAGCCAAGA CCTGCATCAT 1 140 
GGAGGCCTCC ACAGACTTCC TACCAGGCCT GAACTTCAGC AACTGCAGCC GACGGGOCCT 1 200 
GGAGAAAGCC CTCCTGGATG GAATGGGCAG CTGCCTCTTC GAACGGCTGC CTAGCCTACC 1260 
CCCTATGGCT GCTTTCTGCG GAAATATGTT TGTGGAGCCG GGCG AGCAGT GTG ACTGTGO 1320 
CTTCCTGG AT GACTGCGTCG ATCCCTGCTG TG ATTCTTTG ACCTGCCAGC TGAGGCCAGG 1380 
TGCACAGTGT GCATCTGACG GACCCTGTTG TCAAAATTGC CAGCTGCGCC CGTCTGGCTG 1440 
GCAGTGTCGT CCTACCAG AG GGGATTGTGA CTTGCCTGAA TTCTGCCCAG GAGACAGCTC 1 500 
CCAGTGTCCC CCTGATGTCA GCCTAGGGGA TGGCGAGCCC TGCGCTGGCG GGCAAGCTGT 1560 
GTGCATGCAC GGGCGTTGTG CCTCCTATGC CCAGCAGTGC CAGTCACTTT GGGGACCTGG 1620 
AGCCCAGCCC GCTGCGCCAC TTTGCCTCCA GACAGCTAAT ACTCGGGGAA ATGCTTTTGG 1680 
GAGCTGTGGG CGCA ACCCCA GTGGCAGTTA TGTGTCCTGC ACCCCTAGAG ATGCCATTTQ 1 740 
TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG CTGGGCTCCA TCCGGGATCT 1800 
ACTCTGGGAG ACAATAGATG TOAATGGGAC TGAGCTGAAC TGCAGCTGGG TGCACCTGGA 1860 
CCTGGGCAGT GATGTGGCCC AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG 1920 
CCTGGTGTGT ATAGACCATC GATGCCAGCG TGTGGATCTC CTGGGGGCAC AGGAATGTCG 1980 
AAGCAAATGC CATGGACATG GGGTCTGTGA CAGCAACAGG CACTGCTACT GTGAGGAGGG 2040 
CTGGGCACCC CCTGACTGCA CCACTCAGCT CAAAGCAACC AGCTCCCTGA CCACAGGGCT 2100 
GCTCCTCAGC CTCCTGGTCT TATTGGTCCT GGTG ATGCTT GGTGCCGGCT ACTGGTACCG 2160 
TGCCCGCCTG CACCAGCGAC TCTGCCAGCT CAAGGGACCC ACCTGCCAGT ACAGGGCAGC 2220 
CCAATCTGGT CCCTCTGAAC GGCCAGGACC TCCGCAGAGG GCCCTGCTGG CACGAGGCAC 2280 
TAAGTCTCAG GGGCCAGCCA AGCCCCCACC CCCAAGG AAG CCACTGCCTG CCGACCCCCA 2340 
GGGCCGGTGC CCATCGGGTG ACCTGCCCQQ CCCAGGQGCT GQAATCCCQC CCCTAGTGGT 2400 
ACCCTCCAGA CCAGCGCCAC CGCCTCCGAC AGTGTCCTCG CTCTACCT CT G AC CTCTCCQ 2460 
GAGGTTCCGC TGCCTCCAAG CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG 2520 
TCCCCTACCA TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 2580 
ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG GGCTTGGGGA 2640 
GGGGCTGGGG GTTGGACGOG ATTGAGGAAG GTCCGCACAG CCTGTCTCTG CTCAGTTGCA 2700 
ATAAACGTGA CATCTTGGGA GCGTTAAAAA AAAAAAAAAA 

Seq ID NO: 1 12 Protein sequence: 
Protein Accession*: NPJM3806.2 

1 11 21 31 41 51 
I i I I I I 

MRLALLWALG LLGAGSPLPS WPLPNIGGTE EQQAESEKAP REPLEPQVLQ DDLPISLKKV 60 
LQTSLPEPLR IKLELDGDSH ILELLQNREL VPGRPTLVWY QPDGTRWSE GHTLENCCYQ 120 
GRVRGYAGSW VSICTCSGLR GLWLTPERS YTLEQGPGDL QGPPUSR1Q DLHLPGHTCA 180 
LSWRESVHTQ TPPEHPLGQR H1RRRRDWT ETKTVELVIV ADHSEAQKYR DFQHLLNRTL 240 
EVALLLDTFF RPLNVRVALV GLEAWTQRDL VEISPNPAVT LENFLHWRRA HLLPRLPHDS 300 
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AQLVTGTSFS GPTVGMAIQN SICSPDFSGG VNMDHSTSIL GVASSIAHEL GHSLGLDHDL 360 
PGN$CPCPGP APAKTCIMEA STDFLPGLNF SNCSRRALEK ALLDGMGSCL FERLPSLPPM 420 
AAFCGNMFVE PGEQCDCQFL DDCVDPCCDS LTCQLRPGAQ CASDGPCCQN CQLRPSGWQC 480 
RPTRGDCDLP EFCPGDSSQC PPDVSLGDGE PCAGGQAVCM HGRCASYAQQ CQSLWGPGAQ 540 
PAAPLCLQTA NTRGNAFGSC GRNPSGSYVS CTPRDAICGQ LQCQTGRTQP LLGSIRDLLW 600 
ETIDVNGTEL NCSWVHLDLG SDVAQPLLTL PGTACGPGLV CIDHRCQRVD LLGAQECRSK 660 
CHGHGVCDSN RHCYCEEGWA PPDCTTQLKA TSSLTTGLLL SLLVLLVLVM LGAGYWYRAR 720 
LHQRLCQLKG PTCQYRAAQS GPSERPGPPQ RALLARGTKS QGPAKPPPPR KPLPADPQGR 780 
CPSGDLPGPG AGIPPLVVPS RPAPPPPTVS SLYL 

Seq ID NO: 113 DNA sequence 

Nucleic Acid Accession #: NM_0024 1 6 

Coding sequence: 40-41 7 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I ' I I I • I I ^ _ 

ATCCAATACA GGAGTGACTT GGAACTCCAT TCTATCACTAIfiAAGAAAAG TGGTGTTCTT 60 

TTCCTCTTGG GCATCATCTT GCTGGTTCTG ATTGGAGTGC AAGGAACCCC AGTAGTGAGA 120 

AAGGGTCGCT GTTCCTGCAT CAGCACCAAC CAAGGG ACTA TCCACCTACA ATCCTTGAAA 1 80 

GACCTTAAAC AATTTGCCCC AAGCCCTTCC TGCG AGAAAA TTG AAATCAT TGCTACACTG 240 

AAGAATGGAG TTCAAACATG TCTAAACCCA GATTCAGCAG ATGTGAAGGA ACTGATTAAA 300 

AAGTGGG AGA AACAGGTCAG CCAAAAGAAA AAGCAAAAGA ATGGGAAAAA ACATCAAAAA 360 

AAGAAAGTTC TGAAAGTTCG AAAATCTCAA CGTTCTCGTC AAAAGAAGAC TACATAAGAG 420 

ACCACTTCAC CAATAAGTAT TCTGTGTTAA AAATGTTCTA TTTTAATTAT ACCGCTATCA 480 

TTCCAAAGG A GGATGGCATA TAATACAAAG GCTTATTAAT TTG ACT AG AA AATTTAAAAC 540 

ATTACTCTGA AATTGTAACT AAAGTTAGAA AGTTGATTTT AAGAATCCAA ACGTTAAGAA 600 

TTGTTAAAGG CTATGATTGT CTTTGTTCTT CTACCACCCA CCAGTTGAAT TTCATCATGC 660 

TTAAGGCCAT GATTTTAGCA ATACCCATGT CTACACAGAT GTTCACCCAA CCACATCCCA 720 

CTCACAACAG CTGCCTGGAA GAGCAGCCCT AGGCTTCCAC GTACTGCAGC CTCCAGAG AG 780 

TATCTGAGGC ACATGTCAGC AAGTCCTAAG CCTGTTAGCA TGCTGGTGAG CCAAGCAGTT 840 

TGAAATTGAG CTGGACCTCA CCAAGCTGCT GTGGCCATCA ACCTCTGTAT TTGAATCAGC 900 

CTACAGGCCT CACACACAAT GTGTCTG AGA GATTCATGCT GATTGTTATT GGGTATCACC 960 

ACTGGAGATC ACCAGTGTGT GGCTTTCAGA GCCTCCTTTC TGGCTTTGGA AGCCATGTGA 1020 

TTCCATCTTG CCCGCTCAGG CTGACCACTT TATTTCTTTT TGTTCCCCTT TGCTTCATTC 1080 

AAGTCAGCTC TTCTCCATCC TAOCAC AATG CAGTGCCTTT CTTCTCTCCA GTGCACCTGT 1 140 

CATATGCTCT GATTTATCTG AGTCAACTCC TTTCTCATCT TGTCCCCAAC ACCCCACAGA 1200 

AGTGCTTTCT TCTCCCAATT CATCCTCACT CAGTCCAGCT TAGTTCAAGT CCTGCCTCTT 1260 

AAATAAACCT TTTTGGACAC ACAAATTATC TTAAAACTCC TGTTTCACTT GGTTCAGTAC 1320 

CACATGGGTG AACACTCAAT GGTTAACTAA TTCTTGGGTG TTTATCCTAT CTCTCCAACC 1380 

AGATTGTCAG CTCCTTGAGG GCAAGAGCCA CAGTATATTT CCCTGTTTCT TCCACAGTGC 1440 

CTAATAATAC TGTGGAACTA GGTTTTAATA ATTTTTTAAT TGATGTTGTT ATGGGCAGGA 1500 

TGGCAACCAG ACCATTGTCT CAGAGCAGGT GCTGGCTCTT TCCTGGCTAC TCCATGTTGG 1560 

CTAGCCTCTG GTAACCTCTT ACTTATTATC TTCAGG ACAC TCACTACAGG GACCAGGGAT 1 620 

GATGCAACAT CCTTGTCTTT TTATGACAGG ATGTTTGCTC AGCTTCTCCA ACAATAAGAA 1 680 

GCACGTGGTA AAACACTTGC GGATATTCTG GACTGTTTTT AAAAAATATA CAGTTTACCG 1740 

AAAATCATAT AATCTTACA A TG AAAAGG AC TTTATAGATC AGCCAGTGAC CA ACCTTTTC 1 800 

CCAACCATAC AAAAATTCCT TTTCCCGAAG GAAAAGGGCT TTCTCAATAA GCCTCAGCTT I860 

TCTAAGATCT AACAAGATAG CCACCGAGAT CCTTATCGAA ACTCATTTTA GGCAAATATG 1920 

AGTTTTATTG TCCGTTTACT TGTTTCAG AG TTTGTATTGT G ATTATCAAT TACCACACCA 1 980 

TCTCCCATGA AG AAAGGGAA CGGTGAAGTA CTAAGCGCTA GAGGAAGCAG CCAAGTCGGT 2040 

TAGTGGAAGC ATGATTGGTG CCCAGTTAGC CTCTGCAGGA TGTGGAAACC TCCTTCCAGG 2100 

GGAGGTTCAG TGAATTGTGT AGGAGAGGTT GTCTGTGGCC AGAATTTAAA CCTATACTCA 2160 

CTTTCCCAAA TTG AATCACT GCTCACACTG CTG ATGATTT AGAGTGCTGT CCGGTGGAGA 2220 

TCCCACCCGA ACGTCTTATC TAATCATGAA ACTCCCTAGT TCCTTCATGT AACTTCCCTG 2280 

AAAAATCTAA GTGTTTCATA AATTTGAGAG TCTOTGACCC ACTTACCTTG CATCTCACAG 2340 

GTAG ACAGTA TATAACTAAC AACCAA AGAC TACATATTGT CACTGACACA CACGTTATAA 2400 

TCATTTATCA TATATATACA TACATGCATA CACTCTCAAA GCAAATAATT TTTCACTTCA 2460 

AAACAGTATT GACTTGTATA CCTTGTAATT TGAAATATTT TCTTTGTTAA AATAGAATGG 2520 

TATCAATAAA TAGACCATTA ATCAG 

Seq ID NO: 1 14 Protein sequence: 
Protein Accession #: NPJ0O24O7 

1 11 21 31 41 51 

MKKSGVLFLL GIILLVLIG V QGTPWRKGR CSCISTNQGT IHLQSLKDLK QFAPSPSCEK 60 
IEIIATLKNG VQTCLNPDSA DVKEUKKWE KQVSQKKKQK NGKKHQKKKV LKVRKSQRSR 120 . 
QKKTT 

Seq ID NO: 115 DNA sequence 

Nucleic Acid Accession #: NM_003238.1 

Coding sequence: 1 82-1426 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CAAGCAGGAT ACGTTTTTCT GTTGGGCATT GACTAGATTG TTTGCAAAAG TTTCGCATCA 60 
AAAACAAACA ACAACAACAA AAAACCAAAC AACTCTCCTT G ATCTATACT TTG AG AATTG 120 
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TTGATTTCTT TTTTTTTATT CTOA CTTTT A AAAACAACTT TTTTTTCCAC TTTTTTAAAA 1 80 
AAJECACTAC TGTGTGCTGA GCOCTTTTCT G ATCCTGCAT CTGGTCACGG TCGCGCTCAG 240 
CCTGTCTACC TGCAGCACAC TCGATATGGA CCAGTTCATG CGCAAGAGGA TCGAGGCGAT 300 
CCGCGGGCAG ATCCTGAGCA AGCTGAAGCT CACCAGTCCC CCAGAAGACT ATCCTGAGCC 360 
CG AGGAAGTC CCCCCGGAGG TGATTTCCAT CTACAACAGC ACCAGGGACT TGCTCCAGGA 420 
GAAGGCGAGC CGGAGGGCGG CCGCCTGCGA GCGCGAGAGG AGCGACGAAG AGTACTACGC 480 
CAAGGAGGTT TACAAAATAG ACATGCCGCC CTTCTTCCOC TCCGAAAATG CCATCCCGCC 540 
CACTTTCTAC AGACCCTACT TCAGAATTGT TCG ATTTG AC GTCTCAGCAA TGGAG AAGAA 600 
TGCTTCCAAT TTGGTGAAAG CAGAGTTCAG AGTCTTTCGT TTGCAGAACC CAAAAGCCAG 660 
AGTGCCTGAA CAACGGATTG AGCTATATCA GATTCTCAAG TCCAAAGATT TAACATCTCC 720 
AACCCAGCGC TACATCGACA GCAAAGTTGT GAAAACAAGA GCAGAAGGCG AATGGCTCTC 780 
CTTCGATGTA ACTGATGCTG TTCATGAATG GCTTCACCAT AAAGACAGGA ACCTGGGATT 840 
TAAAATAAGC TTACACTGTC CCTGCTGCAC TTTTGTACCA TCTAATAATT ACATCATCCC 900 
AAATAAAAGT GAAGAACTAG AAGCAAGATT TGCAGGTATT GATGGCACCT CCACATATAC 960 
CAGTGGTGAT CAGAAAACTA TAAAGTCCAC TAGGAAAAAA AACAGTGGGA AGACCCCACA 1020 
TCTCCTGCTA ATGTTATTGC CCTCCTACAG ACTTGAGTCA CAACAGACCA ACCGGCGGAA 1080 
GAAGCGTGCT TTGGATGCGG CCTATTGCTT TAGAAATGTG CAGGATAATT GCTGCCTACG 1 140 
TCCACTTTAC ATTGATTTCA AGAGGGATCT AGGGTGGAAA TGGATACACG AACCCAAAGG 1200 
GTACAATGCC AACTTCTGTG CTGGAGCATG CCXXiTATTTA TGGAGTTCAG ACACTCAGCA 1260 
CAGCAGGGTC CTGAGCTTAT ATAATACCAT AAATCCAGAA GCATCTGCTT CTCCTTGCTG 1320 
GGTGTCCCAA GATTTAGAAC CTCTAACCAT TCTCTACTAC ATTGGCAAAA CACCCAAGAT 1380 
TGAACAGCTT TCTAATATGA TTGTAAAGTC TTGCAAATGC AGCIAA A ATT CTTGG AAA AG 1440 
TGGCAAGACC AAAATGACAA TGATGATGAT AATGATGATG ACGACGACAA CGATGATGCT 1500 
TGTAACAAG A AAACATAAG A GAGCCTTGGT TCATCAGTGT TAAAAAATTT TTGAAAAGGC 1 560 
GGTACTAGTT CAGACACTTT GGAAGTTTGT GTTCTGTTTG TTAAAACTGG CATCTQACAC 1620 
AAAAAAAGTT G AAGGCCTTA TTCTACATTT CACCTACTTT GTAAGTGAGA GAGACAAGAA 1680 
GCAAATTTTT TTAAA 

Seq ID NO: 1 16 Protein sequence: 
Protein Accession #: NPJW3229.1 

1 11 21 31 41 51 
I I I I I I 

MHYCVLSAFL ILHLVTVALS LSTCSTLDMD QFMRKRIEAI RGQILSKLKL TSPPEDYPEP 60 
EEVPPEVISI YNSTRDLLQE KASRRAAACE RERSDEEYYA KEVYKIDMPP FFPSENAIPP 120 
TFYRP YFRI V RFDVSAMEKN ASNLVKAEFR VFRLQNPKAR VPEQRIELVQ ILKSKDLTSP 1 80 
TQRYIDSKW KTRAEGEWLS FDVTDAVHEW LHHKDRNLGF KISLHCPCCT FVPSNNYIIP 240 
NKSEELEARF AGIDGTSTYT SGDQKTIKST RKKNSGKTPH LLLMLLPSYR LESQQTNRRK 300 
KRALDAAYCF RNVQDNCCLR PLYIDFKRDL GWKWIHEPKG YNANFCAGAC PYLWSSDTQH 360 
SRVLSLYNTI NPEASASPCC VSQDLEPLTI LYYIGKTPKI EQLSNMIVKS CKCS 

Seq ID NO: 1 17 DNA sequence 

Nucleic Acid Accession*: NM_O00O95.1 

Coding sequence: 26-2299 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CAGCACCCAG CTCCCCGCCA CCGCCATGGT CCCCGACACC GCCTGCGTTC TTCTGCTCAC 60 
CCTGGCTGCC CTCGGCGCGT CCGG ACAGGG CCAGAGCCCG TTGGGCTCAG ACCTGGGCCC 120 
GCAGATGCTT CGGGAACTGC AGGAAACCAA CGCGGCGCTG CAGGACGTGC GGGACTGGCT 180 
GCGGCAGCAG GTCAGGGAGA TCACGTTCCT GAAAAACACG GTGATGGAGT GTGACGCGTG 240 
CGGGATGCAG CAGTCAGTAC GCACCGGCCT ACCCAGCGTG CGGCCCCTGC TCCACTGCGC 300 
GCCCGGCTTC TGCTTCCCCG GCGTGGCCTG CATCCAGACG GAGAGCGGCG GCCGCTGCGG 360 
CCCCTGCCCC GCGGGCTTCA CGGGCAACGG CTCGCACTGC ACCGACGTCA ACGAGTGCAA 420 
CGCCCACCCC TGCTTCCCCC GAGTCCGCTG TATCAACACC AGCCCGGGGT TCCGCTGCGA 480 
GGCTTGCCCG CCGGGGTACA GCGGCCCCAC CCACCAGGGC GTGGGGCTGG CTTTCGCCAA 540 
GOCCAACAAG CAGGTTTGCA CGGACATCAA CGAGTGTGAG ACCGGGCAAC ATAACTGCGT 600 
CCCCAACTCC GTGTGCATCA ACACCCGGGG CTCCTTCCAG TGCGGCCCGT GCCAGCCCGG 660 
CTTCGTGGGC G ACCAGGCGT CCGGCTGCCA GCGCGGCGCA CAGOGCTTCT GCCCCGACGG 720 
CTCGCCCAGC GAGTGCCACG AGCATGCAGA CTGCGTCCTA GAGCGCG ATG GCTCGOGGTC 780 
GTGCGTGTGT CGCGTTGGCT GGGCCGGCAA CGGGATCCTC TGTGGTCGCG ACACTGACCT 840 
AGACGGCTTC CCGGACGAGA AGCTGCGCTG CCCGGAGCCG CAGTGCCGTA AGGACAACTG 900 
CGTGACTGTG CCCAACTCAG GGCAGGAGGA TGTGGACCGC GATGGCATCG GAGACGOCTG 960 
CGATCCGGAT GCCGACGGGG ACGGGGTCCC CAATGAAAAG GACAACTGCC CGCTGGTGCG 1020 
G AACCCAGAC CAGCGCAACA CGGACGAGGA CAAGTGGGGC GATGCGTGCG ACAACTGCCG 10B0 
GTCCCAGAAG AACGACGACC AAAAGGACAC AGACCAGGAC GGCCGGGGCG ATGCGTGCGA 1140 
CGACGACATC GACGGCGACC GGATCCGCAA CCAGGCCGAC AACTGCCCTA GGGTACCCAA 1200 
CTCAGACCAG AAGGACAGTG ATGGCGATGG TATAGGGG AT GCCTGTGACA ACTGTCCCCA 1260 
GAAGAGCAAC CCGGATCAGG CGGATGTGGA CCACGACTTT GTGGGAGATG CTTGTGACAG 1320 
CGATCAAGAC CAGGATGGAG ACGGACATCA GGACTCTCGG GACAACTGTC CCACGGTGCC 1380 
TAACAGTGCC CAGGAGGACT CAGACCACGA TGGCCAGGGT GATGCCTGCG ACGACGACGA 1440 
CGACAATGAC GGAGTCCCTG ACAGTCGGGA CAACTGCCGC CTOOTGCCTA ACCCCGGCCA 1500 
GGAGGACGCG G ACAGGG ACG GCGTGGGOGA OGTGTGCCAG GACGACTTTG ATGCAGACAA 1560 
GGTGGTAGAC AAGATCGACG TGTGTCCGGA GAACGCTGAA GTCACGCTCA CCGACTTCAG 1620 
GGCCTTCCAG ACAGTCGTGC TGGACCCGGA GGGTGACGCG CAGATTGACC CCAACTGGGT 1680 
GGTGCTCAAC CAGGGAAGGG AGATCGTGCA GACAATGAAC AGCGACCCAG GCCTGGCTGT 1740 
GGGTTACACT GCCTTCAATG GCGTGGACTT CGAGGGCACG TTCCATGTGA ACACGGTCAC 1800 
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GGATGACGAC TATGCGOOCT TCATCTTTGG CTACCAGGAC AGCTCCAGCT TCTACGTGGT 1860 
CATGTGGAAG CAGATGGAGC AAACGTATTG GCAGGCGAAC CCCTTCCGTG CTGTGGCCGA 1920 
GCCTGGCATC CAACTCAAGG CTGTGAAGTC TTCCACAGGC CCCGGGGAAC AGCTGCGGAA 1980 
CGCTCTGTGG CATACAOGAG ACACAGAGTC CCAGGTGCGG CTGCTGTGGA AGGACCCGCG 2040 
AAACGTGGGT TGGAAGGACA AGAAGTCCTA TCGTTGGTTC CTGCAGCACC GGCCCCAAGT 2100 
GGGCTACATC AGGGTGCGAT TCTATGAGGG CCCTGAGCTG GTGGCCGACA GCAACGTGGT 2160 
CTTGG ACACA ACCATGCGGG GTGGCCGCCT GGGGGTCTTC TGCTTCTOCC AGGAG AACAT 2220 
CATCTGGGCC AACCTGCGTT ACCGCTGCAA TGACACCATC CCAGAGGACT ATGAGACCCA 2280 
TCAGCTGCGG CAAGCCTAGG GACCAGGGTG AGGACCCGCC GG ATG ACAGC CACCCTCACC 2340 
GCGGCTGGAT GGGGGCTCTG CACCCAGCCC AAGGGGTGGC CGTCCTGAGG GGGAAGTGAG 2400 
AAGGGCTCAG AGAGGACAAA ATAAAGTGTG TGTGCAGGG 

Sea ID NO: 118 Protein sequence: 
Protein Accession #: NP 000086. 1 

1 II 21 31 41 51 
I I I I I I 

MVPDTACVLL LTLAALGASG QGQSPLGSDL GPQMLRELQE TNAALQDVRD WLRQQVREIT 60 
FLKNTVMECD ACGMQQSVRT GLPSVRPLLH CAPGFCFPGV ACIQTESGGR CGPCPAGFTG 120 
NGSHCTDVNE CNAHPCFPRV RCINTSPGFR CEACPPGYSG PTHQG VGLAF AKANKQVCTD 1 80 
INECETGQHN CVPNSVCINT RGSFQCGPCQ PGFVGDQASG CQRGAQRFCP DGSPSECHEH 240 
ADC VLERDGS RSCVCRVGWA GNGILCGRDT, DLDGFPDEKL RCPEPQCRKD NCVTVPNSGQ 300 
EDVDRDGIGD ACDPDADGDG VPNEKDNCPL VRNPDQRNTD EDKWGDACDN CRSQKNDDQK 360 
DTDQDGRGDA CDDDIDGDRI RNQADNCPRV PNSDQKDSDG DGIGDACDNC PQKSNPDQAD 420 
VDHDFVGDAC DSDQDQDQDG HQDSRDNCPT VPNSAQEDSD HDGQGDACDD DDDNDGVPDS 480 
RDNCRLVPNP GQEDADRDGV GDVCQDDFDA DKWDKIDVC PENAEVTLTD FRAFQTWLD 540 
PEGDAQIDPN WVVLNQGREI VQTMNSDPGL AVGYTAFNGV DFEGTFHVNT VTDDDYAGFI 600 
FGYQDSSSFY VVMWKQMEQT YWQANPFRAV AEPGIQLKAV KSSTGPGEQL RNALWHTGDT 660 
ESQVRIXWKD PRNVGWKDKK SYRWFLQHRP QVGYIRVRFY EGPELVADSN WLDTTMRGG 720 
RLG VFCFSQE NHWANLRYR CNDTIPEDYE THQLRQA 

Seq ID NO: 119 DNA sequence 

Nucleic Acid Accession #: NMM421 1 

Coding sequence: 1 57-1 479 (underlined sequences correspond to start and stop codons) 
1 I! 21 31 41 51 

GGGACAGGGC TGAGGATGAG GAGAACCCTG GGGACCCAGA AGACCGTGCC TTGCCCGGAA 60 
GTCCTGCCTG TAGGCCTGAA GGACTTGCCC TAACAG AGCC TCAACAACTA CCTGGTGATT 1 20 
CCTACTTCAG CCCCTTGGTG TGAGCAGCTT CTCAACATGA ACTACAGCCT CCACTTGGCC 1 80 
TTCGTGTGTC TGAGTCTCTT CACTGAGAGG ATGTGCATCC AGGGGAGTCA GTTCAACGTC 240 
GAGGTCGGCA GAAGTGACAA GCTTTCCCTG CCTGGCTTTG AGAACCTCAC AGCAGGATAT 300 
AACAAATTTC TCAGGCCCAA TTTTGGTGGA G AACCCGTAC AGATAGCGCT GACTCTGGAC 360 
ATTGCAAGTA TCTCTAGCAT TTCAGAGAGT AACATGGACT ACACAGCCAC CATATACCTC 420 
CGACAGCGCT GGATGGACCA GCGGCTGGTG TTTGAAGGCA ACAAGAGCTT CACTCTGGAT 480 
GCCCGCCTCG TGG AGTTCCT CTGGGTGCCA GATACTTACA TTGTGGAGTC CAAGAAGTCC 540 
TTCCTCCATG AAGTCACTGT GGGAAACAGG CTCATCCGCC TCTTCTCCA A TGGCACGGTC 600 
CTGTATGCCC TCAGAATCAC GACAACTGTT GCATGTAACA TGGATCTGTC TAAATACCCC 660 
ATGGACACAC AGACATGCAA GTTGCAGCTG GAAAGCTGGG GCTATGATGG AAATGATGTG 720 
G AGTTCACCT GGCTG AG AGG GAACGACTCT GTGCGTGG AC TGGAACACCT GCGGCTTGCT 780 
CAGTACACCA TAG AGCGGTA TTTCACCTTA GTCACCAGAT CGCAGCAGGA GACAG GAAAT 840 
TACACTAGAT TGGTCTTACA GTTTGAGCTT CGGAGG AATG TTCT GTAT TT CATTTTGGAA 900 
ACCTACGTTC CTTCCACTTT CCTGGTGGTG TTGTCCTGGG TTTCATTTTG GATCTCTCTC 960 
GATTCAGTCC CTGCAAGAAC CTGCATTGGA GTGACGACCG TGTTATCAAT GACCACACTG 1020 
ATGATCGGGT CCCGCACTTC TCTTCCCAAC ACCAACTGCT TCATCAAGGC CATCG ATGTG 1080 
TACCTGGGGA TCTGCTTTAG CTTTGTGTTT GGGGCCTTGC TAGAATATGC AGTTGCTCAC 1 140 
TACAGTTCCT TACAGCAG AT GGCAGCCAAA GATAGGGGGA CAACAAAGG A AGTAGAAG AA 1200 
GTCAGTATTA CTAATATCAT CAACAGCTCC ATCTCCAGCT TTAAACGGAA GATCAGCTTT 1260 
GCCAGCATTG AAATTTCCAG CGACAACGTT GACTACAGTG ACTTGACAAT GAAAACCAGC 1320 
GACAAGTTCA AQTTTOTCTT CCGAGAAAAG ATGGGCAGGA TTGTTGATTA TTTCACAATT 1380 
CAAAACCCCA GTA ATGTTG A TCACTATTCC AAACTACTGT TTCCTTTGAT TTTT ATGC TA 1440 
GCCAATGTAT TTTACTGGGC ATACTACATG TATTTTTGAG TCAA TGTT AA ATTTCTTGCA 1500 
TGCCATAGGT CTTCAACAGG ACAAGATAAT GATGTAAATG GTATTTTAGG CCAAGTGTGC 1560 
ACCCACATCC AATGGTGCTA CAAGTGACTG AAATAATATT TGAGTCTTTC TGCTCAAAGA 1620 
ATGAAGCTCC AACCATTGTT CTAAGCTGTG TAGAAGTCCT AGCATTATAG GATCTTGTAA 1680 
TAGAAACATC AGTCCATTCC TCTTTCATCT TAATCAAGGA CATTCCCATG GAGCCCAAGA 1740 
TTACA AATGT ACTCAGGGCT GTTTATTCGG TGGCTCCCTG GTTTGCATTT ACCTCATATA 1 800 
AAGAATGGGA AGGAGACCAT TGGGTAACCC TCAAGTGTCA GAAGTTGTTT CTAAAGTAAC I860 
TATACATGTT TTTTACTAAA TCTCTGCAGT GCTTATAAAA TACATTGTTG CCTATTTAGG 1920 
GAGTAACATT TTCTAGTTTT TGTTTCTGGT TAAAATO AAA T ATGGGCTTA TGTCAATTCA 1980 
TTGGAAGTCA ATGCACTAAC TCAATACCAA G ATGAGTTTT TAAATAATGA ATATTATTTA 2040 
ATACCACAAC AG AATTATCC CCAATTTCCA ATAAGTCCTA TCATTGAAAA TTCAAATATA 2100 
AGTG AAG AAA AAATTAGTAG ATCAACAATC TAAACAAATC CCTCGGTTCT AAG ATACAAT 21 60 
GGATTCCCCA TACTGGAAGO ACTCTGAGGC TTTATTCCCC C ACTATG CAT ATCTTATCAT 2220 
TTTATTATTA TACACACATC CATCCTAAAC TATACTAAAG CCCTTTTCCC ATGCATGGAT 2280 
GGAAATGGAA GATTTTTTTG TAACTTGTTC TAGAAGTCTT AATATGGGCT GTTGCCATGA 2340 
AGGCTTGCAG AATTGAGTCC ATTTTCTAGC TGCCTTTATT CACATAGTGA TGGGGTACTA 2400 
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AAAGTACTGG GTTGACTCAG AGAGTCGCTG TCATTCTGTC ATTGCTGCTA CTCTAACACT 2460 
GAGCAACACT CTCCCAGTGG CAGATCCCCT GTATCATTCC AAGAGGAGCA TTCATCCCTT 2520 
TGCTCTAATG ATCAGGAATG ATGCTTATTA GAAAACAAAC TGCTTGACCC AGGAACAAGT 2580 
GGCTTAGCTT AAGTAAACTT GGCTTTGCTC AGATCCCTG A TOCTTCCAGC TGGTCTGCTC 2640 
TG AGTGGCTT ATCCCGCATG AGCAGG AGCG TGCTGGCCCT G AGTACTGAA CTTTCTGAGT 2700 
AACAATGAGA CACGTTACAG AACCTATGTT CAGGTTGCGG GTGAGCTGCC CTCTCCAAAT 2760 
CCAGCCAGAG ATGCACATTC CTCGGCCAGT CTCAGCCAAC AGTACCAAAA GTGATTTTTG 2820 
AGTGTGCCAG GGTAAAGGCT TCCAGTTCAG GCTCAGTTAT TTTAGACAAT CTCGCCATCT 2880 
TTAAT TTC TT AGCTTCCTGT TCTAATAAAT GCACGGCTTT ACCTTTCCTG TCAG AAATAA 2940 
ACCAAGGCTC TAAAAGATGA TTTCCCTTCT GTAACTCCCT AG AGCCACAG GTTCTCATTC 3000 
CTTTTCCCAT TATACTTCTC ACAATTCAGT TTCTATG AGT TTGATCACCT G ATTTTTTTA 3060 
ACAAAATATT TCTAACGGGA ATGGGTGGGA GTGCTGGTGA AAAGAOATGA AATGTGGTTG 3120 
TATGAGCCAA TCATATTTGT GATTTTTTAA AAAAAGTTTA AAAGGAAATA TCTGTTCTGA 3 180 
AACCCCACTT AAGCATTGTT TTTATATAAA AACAATGATA AAGATGTGAA CTGTGAAATA 3240 
AATATACCAT ATTAGCTACC CACC 

Seq ID NO: 120 Protein sequence; 
Protein Accession #: NP_055026.I 

1 11 21 31 41 51 
I I I I II 

MNYSLHLAFV CLSLFTERMC IQGSQFNVEV GRSDKLSLPG FENLTAGYNK FLRPNFGGEP 60 
VQIALTLDIA SISSISESNM DYTATIYLRQ RWMDQRLVFE GNKSFTLDAR LVEFLWVPDT 120 
YIVESKKSFL HEVTVGNRLI RLFSNGTVLY ALR1TTTVAC NMDLSKYPMD TQTCKLQLES 1 80 
WGYDGMDVEF TWLRGNDSVR GLEHLRLAQY TIERYFTLVT RSQQETGNYT RLVLQFELRR 240 
NVLYFILETY VPSTFLVVLS WVSFWISLDS VPARTCIGVT TVLSMTTLMI GSRTSLPNTN 300 
CFIKA1DVYL GICFSFVFGA LLEYAVAHYS SLQQMAAKDR GTTKEVEEVS 1TNIINSS1S 360 
SFKRKISFAS IEISSDNVDY SDLTMKTSDK FKFVFREKMG RIVDYFTIQN PSNVDHYSKL 420 
LFPLEFMLAN VFYWAYYMYF 

Seq ID NO: 121 DNA sequence 

Nucleic Acid Accession #: NM 001854 

Coding sequence: 1 63-5582 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AACCATCAAA TTTAGAAGAA AAAGCCCTTT G ACTTTTTCC CCCTCTCCCT CCCCAATGGC 60 
TGTGTAGCAA ACATCCCTGG CGATACCTTG GAAAGGAOGA AGTTGGTCTG CAGTCGCAAT 120 
TTCGTGGGTT GAGTTCACAG TTGTG AGTGC GGGGCTCGGA G ATGG AGCCG TGGTCCTCTA 1 80 
GGTGGAAAAC GAAAOGGTGG CTCTGGGATT TCACCGTAAC AACCCTCGCA TTGACCTTCC 240 
TCTTCCAAGC TAGAGAGGTC AGAGGAGCTG CTCCAGTTGA TGTACTAAAA GCACTAGATT 300 
TTCACAATTC TCCAG AGGGA ATATCAAAAA CAACGGGATT TTGCACAAAC AG AAAGAATT 360 
CTAAAGGCTC AGATACTGCT T AC AG AGTTT CAAAGCAAGC ACAACTCAGT GCCCCAACAA 420 
AACAGTTATT TCCAGGTGGA ACTTTCCCAG AAGACTTTTC AATACTATTT ACAGTAAAAC 480 
CAAAAAAAGG AATTCAGTCT TTCCTTTTAT CTATATATAA TGAGCATGGT ATTCAGCAAA 540 
TTGGTGTTGA GGTTGGGAGA TCACCTGTTT TTCTGTTTGA AGACCACACT GGAAAACCTG 600 
CCCCAGAAG A CTATCCCCTC TTCAG AACTG TTAACATCGC TGACGGGAAG TGGCATCQGG 660 
TAGCAATCAG CGTGG AGAAG AAAACTGTGA CAATGATTGT TGATTGTAAG AAG AAAACC A 720 
CGAAACCACT TGATAGAAGT GAG AG AGCAA TTGTTGATAC CAATGGAATC ACGGTTTTTG 780 
GAACAAGGAT TTTGGATGAA GAAGTTTTTG AGGGGG ACAT TCAGCAGTTT TTGATCACAG 840 
GTGATCCCAA GGCAGCATAT GACTACTGTG AG C ATT AT AG TCCAGACTGT GACTCTTCAG 900 
CACCCAAGGC TGCTCAAGCT CAGGAACCTC AG AT AG ATG A GTATGCACC A GAGGATATAA 960 
TCGAATATGA CTATGAGTAT GGGGAAGCAG AGTATAAAGA GGCTGAAAGT GTAAC AGAG G 1020 
GACCCACTGT AACTGAGGAG ACAATAGCAC AGACGGAGGC AAACATCGTT GATGATTTTC 1080 
AAGAATACAA CTATGGAACA ATGGAAAGTT ACCAGACAGA AGCTCCTAGG CATGTTTCTG 1 140 
GGACAAATG A GCCAAATCCA GTTGAAGAAA TATTTACTGA AGAATATCTA ACGGGAGAGG 1200 
ATTATG ATTC CCAG AGGAAA AATTCTGAGG ATACACTATA TOAAA ACAAA GAAATAGACG 1260 
GCAGGGATTC TGATCTTCTG GTAGATGGAG ATTTAGGCGA ATATGATTTT TATGAATATA 1320 
AAGAATATGA AGATAAACCA ACAAGCCCCC CTAATGAAGA ATTTGGTCCA GGTGTACCAG 1380 
CAGAAACTGA TATTACAGAA ACAAGCATAA ATGGCCATGG TGCATATGGA GAGAAAGGAC 1440 
AGAAAGGAGA ACCAGCAGTG GTTGAGCCTG GTATGCTTGT CGAAGGACCA CCAGGACCAG 1500 
CAGGACCTGC AGGTATTATG GGTCCTCCAG GTCTACAAGG CCCCACTGGA CCCCCTGGTG 1560 
ACCCTGGCGA TAGGGGCCCC CCAGGACGTC CTGGCTTACC AGGGGCTGAT GGTCTACCTG 1620 
GTCCTCCTGG TACTATGTTG ATGTTACCGT TCCGTTATGG TGGTGATGGT TCCAAAGGAC 1680 
CAACCATCTC TGCTCAGGAA GCTCAGGCTC AAGCTATTCT TCAGCAGGCT CGGATTGCTC 1740 
TGAGAGGCCC ACCTGGCCCA ATGGGTCTAA CTGGAAGACC AGGTCCTGTG GGGGGGCCTG 1800 
GTTCATCTGG GGCCAAAGGT GAGAGTGGTG ATCCAGGTCC TCAGGGCCCT CGAGGCGTCC 1860 
AGGGTCCCCC TGGTCCAACG GGAAAACCTG GAAAAAGGGG TCGTCCAGGT GCAGATGGAG 1920 
GAAGAGGAAT GCCAGGAGAA CCTGGGGCAA AGGGAGATCG AGGGTTTGAT GGACTTCCGG 1980 
GTCTGCCAGG TGACAAAGGT CACAGGGGTG AACGAGGTCC TCAAGGTCCT CCAGGTCCTC 2040 
CTGGTGATGA TGGAATGAGG GGAGAAGATG GAGAAATTGG ACCAAGAGGT CTTCCAGGTG 2100 
AAGCTGGCCC ACGAGGTTTG CTGGGTCCAA GGGGAACTCC AGGAGCTCCA GGGCAGCCTG 2160 
GTATGGCAGG TGTAGATGGC CCCCCAGGAC CAAAAGGGAA CATGGGTCCC CAAGGGGAGC 2220 
CTGGGCCTCC AGGTGAACAA GGGAATCCAG GACCTCAGGG TCTTCCTGGT OCACAAGGTC 2280 
CAATTGGTCC TCCTGGTGAA AAAGGACCAC AAGGAAAACC AGGACTTGCT GG ACTTCCTG 2340 
GTGCTGATGG GCCTCCTGGT CATCCTGGGA AAGAAGGCCA GTCTGGAGAA AAGGGGGCTC 2400 
TGGGTCCCCC TGGTCCACAA GGTCCTATTG GATNNCCGGG CCCCCGGGGA GTAAAGGGAG 2460 
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CAGATGGTGT CAG AGGTCTC AAGGGATCTA AAGGTGAA AA GGGTGAAG AT GGTTTTOCAG 2520 
GATTCAAAGG TGACATGGGT CTAAAAGGTG ACAGAGGAGA AGTTGGTCAA ATTGGCCCAA 2580 
GAGGGNAAGA TGGCCCTGAA GGACCCAAAG GTCGAGCAGG CCCAACTGGA GACCCAGGTC 2640 
CTTCAGGTCA AGCAGGAGAA AAGGGAAAAC TTGGAGTTCC AGGATTACCA GGATATCCAG 2700 
GAAGACAAGG TCCAAAGGGT TCCACTGGAT TCCCTGGGTT TCCAGGTGCC AATGGAG AG A 2760 
AAGGTGCACG GGG AGTAGCT GGCAAACCAG GCCCTCGGGG TCAGCGTGGT CGAACGGGTC 2820 
CTCGAGGTTC AAGAGGTGCA AGAGGTCCCA CTGGGAAACC TGGGCCAAAG GGCACTTCAG 2880 
GTGGCGATGG CCCTCCTGGC CCTCCAGGTG AAAGAGGTCC TCAAGGACCT CAGGGTCCAG 2940 
TTGGATTCCC TGGACCAAAA GGCCCTCCTG GACCACCAGG AAGGATGGGC TGCCCAGGAC 3000 
ACCCTGGGCA ACGTGGGGAG ACTGGATTTC AAGGCAAGAC CGGCCCTCCT GGGCCAGGGG 3060 
GAGTGGTTGG ACCACAGGGA CCAACCGGTG AGACTGGTCC AATAGGGGAA CGTGGGTATC 3120 
CTGGTCCTCC TGGCCCTCCT GGTGAGCAAG GTCTTCCTGG TGCTGCAGG A AAAGAAGGTG 3180 
CAAAGGGTGA TCCAGGTCCT CAAGGTATCT CAGGGAAAGA TGGACCAGCA GGATTACGTG 3240 
GTTTCCCAGG GGAAAGAGGT CTTCCTGGAG CTCAGGGTGC ACCTGGACTG AAAGGAGGGG 3300 
AAGGTCCCCA GGGCCCACCA GGTCCAGTTG GCTCACCAGG AGAACGTGGG TCAGCAGGTA 3360 
CAGCTGGCCC AATTGGTTTA CGAGGGCGCC CGGGACCTCA GGGTCCTCCT GGTCCAGCTG 3420 
GAGAGAAAGG TGCTCCTGGA GAAAAAGGTC CCCAAGGGCC TGCAGGGAGA GATGGAGTTC 3480 
AAGGTCCTGT TGGTCTCCCA GGGCCAGCTG GTCCTGCCGG CTCCCCTGGG GAAGACGGAG 3540 
ACAAGGGTGA AATTGGTGAG CCGGGACAAA AAGGCAGCAA GGGTGGCAAG GGAGAAAATG 3600 
GCCCTCCCGG TCCCCCAGGT CTTCAAGGAC CAGTTGGTGC CCCTGGAATT GCTGG AGGTG 3660 
ATGGTGAACC AGGTCCTAGA GGACAGCAGG GGATGTTTGG GCAAAAAGGT GATGAGGGTG 3720 
CCAGAGGCTT CCCTGGACCT CCTGGTCCAA TAGGTCTTCA GGGTCTGCCA GGCCCACCTG 3780 
GTGAAAAAGG TGAAAATGGG GATGTTGGTC CATGGGGGCC ACCTGGTCCT CCAGGCCCAA 3840 
GAGGCCCTCA AGGTCCCAAT GGAGCTGATG GACCACAAGG ACCCCCAGGT TCTGTTGGTT 3900 
CAGTTGGTGG TGTTGGAGAA AAGGGTGAAC CTGGAGAAGC AGGAAACCCA GGGCCTCCTG 3960 
GGGAAGCAGG TGTAGGCGGT CCCAAAGGAG AAAGAGGAGA GAAAGGGGAA GCTGGTCCAC 4020 
CTGGAGCTGC TGGACCTCCA GGTGCCAAGG GGCCGCCAGG TGATGATGGC CCTAAGGGTA 4080 
ACCCGGGTCC TGTTGGTTTT CCTGGAGATC CTGGTCCTCC TGGGGAACTT GGCCCTGCAG 4140 
GTCAAGATGG TGTTGGTGGT GACAAGGGTG AAGATGGAGA TCCTGGTCAA CCGGGTCCTC 4200 
CTGGCCCATC TGGTG AGGCT GGCCCACCAG GTCCTCCTGG AAAACGAGGT CCTCCTGGAG 4260 
CTGCAGGTGC AGAGGOAAGA CAAGGTGAAA AAGGTGCTAA GGGGGAAGCA GGTGCAG AAG 4320 
GTCCTCCTGG AAAAACCGGC CCAGTCGGTC CTCAGGG ACC TGCAGG AAAG CCTGGTCCAG 4380 
AAGGTCTTCG GGGCATCCCT GGTCCTGTGG GAGAACAAGG TCTCCCTGGA GCTGCAGGCC 4440 
AAGATGGACC ACCTGGTCCT ATGGGACCTC CTGGCTTACC TGGTCTCAAA GGTGACCCTG 4500 
GCTCCAAGGG TGAAA AGGGA CATCCTGGTT TAATTGGCCT GATTGGTCCT CCAGGAG AAC 4560 
AAGGGGAAAA AGGTGACCGA GGGCTCCCTG GAACTCAAGG ATCTCCAGGA GCAAAAGGGG 4620 
ATGGGGGAAT TCCTGGTCCT GCTGGTCCCT TAGGTCCACC TGGTCCTCCA GGCTTACCAG 4680 
GTCCTCAAGG CCCAAAGGGT AACAAAGGCT CTACTGG ACC CGCTGGCCAG AAA GGTG ACA 4740 
GTGGTCTTCC AGGGCCTCCT GGGCCTCCAG GTCCACCTGG TGAAGTCATT CAGCCTTTAC 4800 
CAATCTTGTC CTCCAAAAAA ACGAGAAGAC ATACTGAAGG CATGCAAGCA GATGCAGATG 4860 
ATAATATTCT TGATTACTCG G ATGGAATGG AAG AAATATT TGGTTCCCTC AATTCCCTGA 4920 
AACAAGACAT CGAGCATATG AAATTTCCAA TGGGTACTCA GACCAATCCA GCCCG AACTT 4980 
GTAAAGACCT GCAACTCAGC CATCCTGACT TCCCAGATGG TGAATATTGG ATTGATCCTA 5040 
ACCAAGGTTG CTCAGGAGAT TCCTTCAAAG TTTACTGTAA TTTCACATCT GGTGGTGAGA 5100 
CTTGCATTTA TCCAGACAAA AAATCTGAGG GAGTAAGAAT TTCATCATGG CCAAAGGAGA 5160 
AACCAGGAAG TTGGTTTAGT GAATTTAAGA GGGGAAAACT GCTTTCATAC TTAGATGTTG S220 
AAGGAAATTC CATCAATATG GTGCAAATGA CATTCCTGAA ACTTCTGACT GCCTCTGCTC 5280 
GGCAAAATTT CACCTACCAC TGTCATCAGT CAGCAGCCTG GTATGATGTG TCATCAGG AA 5340 
GTTATG ACAA AGCACTTCGC TTCCTGGGAT CAAATGATG A GGAGATGTCC TATGACAATA 5400 
ATCCTTTTAT CAAAACACTG TATGATGGTT GTACGTCCAG AAAAGGCTAT GAAAAAACTG 5460 
TCATTGA AAT CAATACACCA AAAATTGATC AAGTACCTAT TGTTGATGTC ATGATC AGTG 5520 
ACTTTGGTGA TCAGAATCAG AAGTTCGGAT TTGAAGTTGG TCCTGTTTGT TTTCTTGGQL 5580 
AAGATTAAGA CAA AGAACAT ATCAAATCAA CAGAAAATGT ACCTTGGTGC CACCAACCCA 5640 
TTTTGTGCCA CATGCAAGTT TTGAATAAGG ATGTATGGAA AACAACGCTG CATATACAGG 5700 
TACCATTTAG GAAATACCG A TGCCTTTGTG GGGGCAGAAT CACAGACAAA AGCTTTGAAA 5760 
ATCATAAAGA TATAAGTTGG TGTGGCTAAG ATGGAAACAG GGCTGATTCT TGATTCCCAA 5820 
TTCTCAACTC TCCTTTTCCT ATTTGAATTT CTTTGGTGCT GTAGAAAACA AAAAAAGAAA 5880 
AATATATATT CATAAAAAAT ATGGTGCTCA TTCTCATCCA TCCAGG ATGT ACTAAAACAG 5940 
TGTGTTTAAT AAATTGTAAT TATTTTGTGT ACAGTTCTAT ACTGTTATCT GTGTCCATTT 6000 
CCAAAACTTG CACGTGTCCC TGAATTCCGC TG ACTCTAAT TT ATG AGG AT GCCGAACTCT 6060 
GATGGCAATA ATATATGTAT TATGAAAATG AAGTTATGAT TTCCGATGAC CCTAAGTCCC 6120 
TTTCTTTGGT TAATGATGAA ATTCCTTTGT GTGTGTTT 

Scq ID NO: 122 Protein sequence: 
Protein Accession if: NPjOOl 845 



1 11 21 31 41 51 

MEPWSSRWKT KRWLWDFTVT TLALTFLFQA REVRGAAPVD VLKALDFHNS PEGISKTTGF 60 
CTNRKNSKGS DTAYRVSKQA QLSAPTKQLF PGGTFPEDFS ILFTVKPKKG IQSFLLSIYN 120 
EHGIQQIGVE VGRSPVFLFE DHTGKPAPED YPLFRTVNIA DGKWHRVAIS VEKKTVTM1V 1 80 
DCKKKTTKPL DRSERAIVDT NGITVFGTRI LDEEVFEGDI QQFLITGDPK AAYDYCEHYS 240 
PDCDSSAPKA AQAQEPQIDE YAPEDIIEYD YEYGEAEYKE AESVTEGPTV TEETIAQTEA 300 
NIVDDFQEYN YGTMESYQTE APRHVSGTNE PNPVEEIFTE EYLTGEDYDS QRKNSEDTLY 360 
ENKEIDGRDS DLLVDGDLGE YDFYEYKEYE DKPTSPPNEE FGPGVPAETD ITETSINGHG 420 
AYGEKGQKGE PAWEPGMLV EGPPGPAGPA GIMGPPGLQG PTGPPGDPGD RGPPGRPGLP 480 
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GADGLPGPPG TMLMLPFRYG GDGSKGPTIS AQEAQAQAIL QQARIALRG? PGPMGLTGRP 540 
GPVGGPGSSG AKGESGDPGP QGPRGVQGPP GPTGKPGKRG RPGADGGRGM PGEPGAKGDR 600 
GFDGLPGLPG DKGHRGERGP QGPPGPPGDD GMRGEDGEIG PRGLPGEAGP RGLLGPRGTP 660 
GAPGQPGMAG VDGPPGPKGN MGPQGEPGPP GQQGNPGPQG LPGPQGPIGP PGEKGPQGKP 720 
GLAGLPGADG PPGHPGKEGQ SGEKGALGPP GPQGPIGXPG PRGVKGADGV RGLKGSKGEK 780 
GEDGFPGFKG DMGLKGDRGE VGQIGPRGXD GPEGPKGRAG PTGDPGPSGQ AGEKGKLGVP 840 
GLPGYPGRQG PKGSTGFPGF PGANGEKGAR GVAGKPGPRG QRGPTGPRGS RGARGPTGKP 900 
GPKGTSGGDG PPGPPGERGP QGPQGPVGFP GPKGPPGPPG RMGCPGHPGQ RGETGFQGKT 960 
GPPGPGGVVG PQGPTGETQP IGERGYPGPP GPPGEQGLPG A AGKEG AKGD PGPQGISGKD 1020 
GPAGLRGFPG ERGLFGAQGA PGLKGGEGPQ G?PG?VGSPG ERGSAGTAGP IGLRGRPGPQ 1080 
GPPGPAGEKG APGEKGPQGP AGRDGVQGPV GLPGPAGPAG SPGEDGDKGE IGEPGQKGSK 1140 
GGKGENGPPG PPGLQGPVGA PGlAGGDGEP GPRGQQGMFG QKGDEGARGF PGPPGPIGLQ 1200 
GLPGPPGEKG ENGDVGPWGP PGPPGPRGPQ GPNGADGPQG PPGSVGSVGG VGEKGEPGEA 1260 
GNPGPPGEAG VGGPKGERGE KGEAGPPGAA GPPGAKGPPG DDGPKGNPGP VGFPGDPGPP 1320 
GELGPAGQDG VGGDKGEDGD PGQPGPPGPS GEAGPPGPPG KRGPPGAAGA EGRQGEKGAK 1380 
GEAGAEGPPG KTGPVGPQGP AGKPGPEGLR GIPGPVCEQG LPG AAGQDGP PGPMGPPGLP 1440 
GLKGDPGSKG EKGHPGLIGL IGPPGEQGEK GDRGLPGTQG SPGAKGDGGI PGPAGPLGPP 1500 
GPPGLPGPQG PKGNKGSTGP AGQKGDSGLP GPPGPPGPPG EVIQPLPILS SKKTRRHTEG 1560 
MQADADDNIL DYSDGMEEIF GSLNSLKQDI EHMKFPMGTQ TNPARTCKDL QLSHPDFPDG 1620 
EYWIDPNQGC SGDSFKVYCN FTSGGETCIY PDKKSEGVRI SSWPKEKPGS WFSEFKRGKL 1680 
LSYLDVEGNS INMVQMTFLK LLTASARQNF TYHCHQSAAW YDVSSGSYDK ALRFLGSNDE 1740 
EMSYDNNPFI KTLYDGCTSR KG YEKTVIEI NTPKIDQVPI VDVMISDFGD QNQKFGFEVG 1800 
PVCFLG 

Seq ID NO: 123 DNA sequence 

Nucleic Acid Accession #: NM 015886 

Coding sequence: 485-1 261 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

G AATTCCCCC CCCCCCCCCC TCACTTGGTG TGTCTATATG TCTGGCAGAC ATTATCAGCA 60 
CATTCTCTGT TGTTACCTGT GATTCATTTT TTCTTC ACTC TCCAGGTGAA TTTCAATTGC 1 20 
TGAAAATTTC CCACTGA AA A TATGCAGTAA TATATTTTGT GGTTCAGACA TTTGGGGCAA 1 80 
ATGGTTCACA TTCATTTTAG GGTTAGTGGT CATGCTGTTT ATTTTTCTCT GCTATACAAA 240 
GTTCCTCTTA GGGGTCTGCC TCATG ACACT AAAAAATGAA TAG AG ATTCT ACTGTAGGTT 300 
ATCTCCTAGG CTTGAGTTCA ACATTTGTTT GGATTTTTGA AG A AAGTC AA ATCAAGCAAT 360 
GCTCCC AAAT GATGTCTTTG TAAATTCATA CCCTCTGGCC CTATTTTTTT TCATAGACCC 420 
TAACTCTACC TTTCTGCTTT AAAGCAAAGT AAACTCGGTG GCCTCTTCTT CTCCACCCCT 480 
CAAAATGATA GCAATCTCTG CCGTCAGCAG TGCACTCCTG TTCTCCCTTC TCTGTGAAGC 540 
AAGTACCGTC GTCCTACTCA ATTCCACTGA CTCATCCCCG CCAACCAATA ATTTCACTGA 600 
TATTGAAGCA GCTCTGAAAG CACAATTAGA TTCAGCGGAT ATCCCCAAAG CCAGGCGGAA 660 
GCGCTACATT TCGCAG AATG ACATGATCGC CATTCTTGAT TATCATAATC AAGTTCGGGG 720 
CAAAGTGTTC CCACCGGCAG CAAATATGG A ATATATGGTT TGGGATGAAA ATCTTGCAAA 780 
ATCGGCAGAG GCTTGGGCGG CTACTTGCAT TTGGGACCAT GGACCTTCTT ACTTACTGAG 840 
ATTTTTGGGC CAAAATCTAT CTGTACGCAC TGGAAGATAT CGCTCTATTC TCCAGTTGGT 900 
CAAGCCATGG TATGATGAAG TGAAAGATTA TGCTTTTCCA TATCCCCAGG ATTGCAACCC 960 
CAGATGTCCT ATGAG ATGTT TTGGTCCCAT GTGCACACAT TATACGCAG A TGGTTTGGGC 1 020 
CACTTCCAAT CGGATAGGAT GCGCAATTCA TGCTTGCCAA AACATGAATG TTTGGGGATC 1080 
TGTGTGGCG A CGTGCAGTTT ACTTGGTATG CAACTATGCC CCAAAGGGCA ATTGGATTGG 1 140 
AGAAGCACCA TATAAAGTAG GGGTACCATG TTCATCTTGT CCTCCAAGTT ATGGGGGATC 1200 
TTGTACTGAC AATCTGTGTT TTCCAGGAGT TACGTCAAAC TACCTGTACT GGTTTAAATA 1260 
AGTTTACCTT TTCCTCCAGG AAATATAATG ATTTCTGGGA ACATGGGC AT GTATATATAT 1320 
ATATGGAGAG AGAATTTTGC ACATATTATA CATATTTTGT GCTAATCTTG TTTTCCTCTT 1380 
AGTATTCCTT TGTATAAATT AGTGTTTGTC TAGCATGTTT GTTTAATCCT TTGGGAATTC 

Seq ID NO: 124 Protein sequence: 
Protein Accession #: NPj056970.1 

1 11 21 31 41 51 

I I I I I I 

MIAISAVSSA LLFSLLCEAS TWLLNSTDS SPPTNNFTD1 EAALKAQLDS ADIPKARRKR 60 
Y1SQNDMIAI LDYHNQVRGK VFPPAANMEY MVWDENLAKS AEAWAATCIW DHGPSYLLRF 120 
LGQNLSVRTG RYRSILQLVK PWYDEVKDYA FPYPQDCNPR CPMRCFGPMC THYTQMVWAT 1 80 
SNRIGCAIHA CQNMNVWGSV WRRAVYLVCN YAPKGNWIGE APYKVGVPCS SCPPSYGGSC 240 
TDNLCFPGVT SNYLYWFK 



Seq ID NO: 125 DNA sequence 

Nucleic Acid Accession #: NM_001793 

Coding sequence: 54-2543 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

GCGGAACACC GGCCCGCCGT CGCGGCAGCT GCTTCACCCC TCTCTCTGCA GCCAJ£GGGC 60 
TCCCTCGTGG ACCTCTCGCG TCTCTCCTCC TTCTCCAGGT TTGCTGGCTG CAGTGCGCGG 1 20 
CCTCCGAGCC GTGCCGGGCG GTCTTCAGGG AGGCTGAAGT G ACCTTGGAG GCGGG AGGCG 1 80 
CGGAGCAGGA GCCCGGCCAG GCGCTGGGGA AAGTATTCAT GGGCTGCCCT GGGCAAGAGC 240 
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CAGCTCTGTT TAGCACTGAT AATG ATGACT TCACTGTGCG GAATGGCGAG ACAGTCCAGG 300 
AAAGAAGGTC ACTGAAGG AA AGGAATCCAT TGAAGATCTT CCCATCCAAA CGTATCTTAC 360 
GAAGACACAA GAGAG ATTGG GTGGTTGCTC CAATATCTGT CCCTGAAAAT GGCAAGGGTC 420 
CCTTCCCCCA GAGACTGAAT CAGCTCAAGT CTAATAAAGA TAGAGACACC AAGATTTTCT 480 
ACAGCATCAC GGGGCCGGGG GCAGACAGCC CCCCTGAGGG TGTCTTCGCT GTAGAGAAGG 540 
AG ACAGGCTG GTTGTTGTTG AATAAGCCAC TGGACCGGGA GGAGATTGCC AAGTATGAGC 600 
TCTTTGGCCA CGCTGTGTCA G AG AATGGTG CCTCAGTGGA GGACCCCATG AACATCTCCA 660 
TCATCGTG AC CGACCAG AAT G ACCACA AGC CCAAGTTTAC CCAGGACACC TTCCGAGGGA 720 
GTGTCTTAGA GGGAGTCCTA CCAGGTACTT CTGTGATGCA GGTGACAGCC ACAGATGAGG 780 
ATGATGCCAT CTACACCTAC AATGGGGTGG TTGCTTACTC CATCCATAGC CAAGAACCAA 840 
AGGACCCACA CGACCTCATG TTCACAATTC ACCGGAGCAC AGGCACCATC AGCGTCATCT 900 
CCAGTGGCCT GGACCGGGAA AAAGTCCCTG AGTACACACT GACCATCCAG GCCACAGACA 960 
TGGATGGGGA CGGCTCCACC ACCACGGCAG TGGCAGTAOT GGAGATCCTT GATGCCAATG 1020 
ACAAtGCTCC CATGTTTGAC CCCCAGAAGT ACGAGGCCCA TGTGCCTGAG AATGCAGTGG 1080 
GCCATGAGGT GCAGAGGCTG ACGGTCACTG ATCTGGACGC CCCCAACTCA CCAGCGTGGC 1 140 
GTGCCACCTA CCTTATCATG GGCGGTGACG ACGGGGACCA TTTTACCATC ACCACCCACC 1200 
CTGAGAGCAA CCAGGGCATC CTGACAACCA GGAAGGGTTT GGATTTTGAG GCCAAAAACC 1260 
AGCACACCCT GTACGTTGAA GTGACCAACG AGGCCCCTTT TGTGCTG AAG CTCCCAACCT 1320 
CCACAGCCAC CATAGTGGTC CACGTGGAGG ATGTGAATGA GGCACCTGTG TTTGTCCCAC 1380 
CCTCCAAAGT CGTTGAGGTC CAGGAGGGCA TCCCCACTGG GGAGCCTGTG TGTGTCTACA 1440 
CTGCAG A AGA CCCTGACAAG G AGAATCAAA AG ATCAGCTA CCGCATCCTG AGAG ACCCAG 1 500 
CAGGGTGGCT AGCCATGGAC CCAGACAGTG GGCAGGTCAC AGCTGTGGGC ACCCTCGACC 1560 
GTGAGGATGA GCAGTTTGTG AGGAACAACA TCTATGAAGT CATGGTCTTG GCCATGGACA 1620 
ATGGAAGCCC TCCCACCACT GGCACGGGAA CCCTTCTGCT AACACTGATT GATGTCAACG 1680 
ACCATGGCCC AGTCCCTGAG CCCCGTCAGA TCACCATCTG CAACCAAAGC CCTGTGCGCC 1740 
ACGTGCTG AA CATCACGG AC AAGGACCTGT CTCCCCACAC CTCCCCTTTC CAGGCCCAGC 1800 
TCACAGATGA CTCAGACATC TACTGGACGG CAG AGGTCAA CG AGG A AGGT GACACAGTGG 1 860 
TXTTGTCCCT GAAGAAGTTC CTGAAGCAGG ATACATATGA CGTGCACCTT TCTCTGTCTG 1920 
ACCATGGCAA CAAAGAGCAG CTGACGGTGA TCAGGGCCAC TGTGTGCGAC TGCCATGGCC 1980 
ATGTCGAAAC CTGCCCTGG A CCCTGGAAAG GAGGTTTCAT CCTCCCTGTG CTGGGGGCTG 2040 
TCCTGGCTCT GCTGTTCCTC CTGCTGGTGC TGCTTTTGTT GGTGAGAAAG AAGCGGAAGA 2100 
TCAAGGAGCC CCTCCTACTC CCAGAAGATG ACACCCGTGA CAACGTCTTC TACTATGGCG 2160 
AAGAGGGGGG TGGCGAAGAG GACCAGGACT ATGACATCAC CCAGCTCCAC CGAGGTCTGG 2220 
AGGCCAGGCC GGAGGTGGTT CTCCGCAATG ACGTGGCACC AACCATCATC CCGACACCCA 2280 
TGTACCGTCC TAGGCCAGCC AACCCAG ATG AAATCGGCAA CTTTATAATT GAGAACCTGA 2340 
AGGCGGCTAA CACAGACCCC ACAGCCCCGC CCTACGACAC CCTCTTGGTG TTCGACTATG 2400 
AGGGCAGCGG CTCCGACGCC GCGTCCCTGA GCTCCCTCAC CTCCTCCGCG TCCGACCAAG 2460 
ACCAAGATTA CGATTATCTG AACGAGTGGG GCAGCCGCTT CAAGAAGCTG GCAGACATGT 2520 
ACGGTGGCGG GGAGGACGAC TAOGCGGCCT GCCTGCAGGG CTGGGGACCA AACGTCAGGC 2580 
CACAG AGCAT CTCCAAGGGG TCTCAGTTCC CCCTTCAGCT GAGGACTTCG GAGCTTGTCA 2640 
GGAAGTGGCC GTAGCAACTT GGCGGAGACA GGCTATGAGT CTGACGTTAG AOTGGTTGCT 2700 
TCCTTAGCCT TTCAGG ATGG AGGAATGTGG GCAGTTTGAC TTCAGCACTG AAAACCTCTC 2760 
CACCTGGGCC AG GGTTGCCT CAGAGGCCAA GTTTCCAGAA GCCTCTTACC TGCCGTAAAA 2820 
TGCTCAACCC TGTGTCCTGG GCCTGGGCCT GCTGTGACTG ACCTACAGTG GACTTTCTCT 2880 
CTGOAATGGA AO CTTCTTA O GCCTCCTGGT GCAACTTAAT TTTTTTTTTT AATGCTATCT 2940 
TCAAAACGTT AGAGAAAGTT CTTCAAAAGT GCAGCCCAGA GCTGCTGGGC CCA CTGGCC G 3000 
TCCTGCATTT CTGGTTTCCA GACCCCAATG CCTCCCATTC GGATGGATCT CTGCGTTTTT 3060 
ATACTGAGTG TGCCTAGGTT GCCCCTTATT TTTTATTTTC CCTGTTGCGT TGCTATAGAT 3120 
G AAGGGTG AG GACAATCGTG TATATGTACT AGAACTTTTT TATTAAAGAA A 

Seg ID NO: 126 Protein sequence: 
Protein Accession ft NP 001784 

1 11 21 31 41 51 

I I I I I I 

MGLPRGPLAS LLLLQVCWLQ CAASEPCRAV FREAEVTLEA GG AEQEPGQA LGKVFMGCPG 60 
QEPALFSTDN DDFTVRNGET VQERRSLKER NPLKIFPSKR ILRRHKRDWV VAPISVPENG 120 
KGPFPQRLNQ LKSNKDRDTK IFYSITGPGA DSPPEGVFAV EKETGWLLLN KPLDREEIAK 180 
YELFGHAVSE NG ASVEDPMN ISIIVTDQND HKPKFTQDTF RGSVLEGVLP GTSVMQVTAT 240 
DEDDAIYTYN GWAYSIHSQ EPKDPHDLMF TIHRSTGTIS VISSGLDREK VPEYTLTIQA 300 
TDMDGDGSTT TAVAVVEILD ANDNAPMFDP QKYEAHVPEN AVGHEVQRLT VTDLDAPNSP 360 
AWRATYLIMG GDDGDHFTIT THPESNQGIL TTRKGLDFEA KNQHTLYVEV TNEAPFVLKL 420 
PTSTATTVVH VEDVNEAPVF VPPSKWEVQ EGIPTGEPVC VYTAEDPDKE NQKISYRILR 480 
DPAGWLAMDP DSGQVTAVGT LDREDEQFVR NNIYEVMVLA MDNGSPPTTG TGTLLLTUD 540 
VNDHGP VPEP RQITICNQSP VRHVLNTTDK DLSPHTSPFQ AQLTDDSDIY WTAEVNEEGD 600 
TVVLSLKKFL KQDTYDVHLS LSDHGNKEQL TVIRATVCDC HGHVETCPGP WKGGFILPVL 660 
GAVLALLFLL LVLLLLVRKK RKIKEPLLLP EDDTRDNVFY YGEEGGGEED QDYDITQLHR 720 
GLEARPEWL RNDVAPTIIP TPMYRPRPAN PDEIGNFIIE NLKAANTDPT APPYDTLLVF 780 
DYEGSGSDAA SLSSLTSSAS DQDQDYDYLN EWGSRFKKLA DMYGGGEDD 

Seq ID NO: 1 27 DN A sequence 

Nucleic Acid Accession #: NM_003 256.1 

Coding sequence: 60-734 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 
1 I I I I I 

CCTGCTGGGG CCGTCCAGTC CCCCAGACCT CACAG GCTCA GTCGCGGATC TGCAGTGTCA 60 
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• 

IGCCTGGG AG CCCTCGGCCC GCGCCAAGCT GGGTGCTGTT GCTGCGGCTG CTGGCGTTGC 1 20 
TGCGGCCCCC GGGGCTGGGT GAGGCATGCA GCTGCGCCCC GGCGCACCCT CAGCAGCACA 1 80 
TCTGCCACTC GGCACTTGTG ATTCGGGCCA AAATCTCCAG TG AGAAGGTA GTTCCGGCCA 240 
GTGCAGACCC TGCTGACACT GAAAAAATGC TCCGGTATGA AATCAAACAG ATAAAGATGT 300 
TCAAAGGGTT TGAGAAAGTC A AGGATGTTC AGTATATCTA TACGCCTTTT G ACTCITCCC 360 
TCTGTGGTGT GA AACTAGAA GCC AACAGCC AG AAGCAGTA TCTCTTGACT GGTCAGGTCC 420 
TCAGTGATGG A AAAGTCTTC ATCCATCTGT GCAACTACAT CGAGCCCTGG GAGGACCTGT 480 
CCTTGGTGCA G AGGG AAAGT CTGAATCATC ACTACCATCT GAACTGTGGC TGCCAAATCA 540 
CCACCTGCTA CACAGTACCC TGTACCATCT CGGCCCCTAA CGAGTGCCTC TGGACAGACT 600 
GGCTGTTGGA ACGAAAGCTC TATGGTTACC AGGCTCAGCA TTATGTCTGT ATGAAGCATG 660 
TTG ACGGCAC CTGCAGCTGG TACCGGGGCC ACCTGCCTCT CAGG AAGG AG TTTGTTGACA 720 
TCGTTCAGCC CTAGTAGGGA CCAGTGACCA TCACATCCCT TCAAGAGTCC TGAAGATCAA 780 
GCCAGTTCTC CTTCCCTGCA GAGCTTTGGC CATTACCACC TGACCTCTTG CTGCCAGCTA 840 
ATAAGAAGTG CCAAGTGGAC AGTCTGGCCA CTGTCAAGGC AGGGAAGGGG CCATGACTTT 900 
TCTGCCCTGC CCTCAGCCTG TTGCCCTGCC TC CCAAA CCC CATTAGTCTA GCCTTGTAGC 960 
TGTTACTGCA AGTGTTTCTT CTGGCTTAGT CTGTTTTCTA AAGCCAGGAC TATTCCCTTT 1 020 
CCTCCCCAGG AATATGTGTT TTCCTTTGTC TTAATCGATC TGGTAGGGG A GAAATGGCGA 1080 
ATGTCATACA CATGAGATGG TATATCCTTG CGATGTACAG AATCAGAAGG TGGTTTGACA 1 140 
GCATCATAAA CAGGCTGACT GGCAGGAATG AAAAAAAAAA AAAAAAAAA 

Sea ID NO: 128 Protein sequence; 
Protein Accession #: NP 003247.1 

I II 21 31 1 41 51 
I 111 1 1 

MPGSPRPAPS WVLLLRLLAL LRPPGLGEAC SCAPAHPQQH ICHSALVIRA K1SSEKWPA 60 
SADPADTEKM LRYEIKQIKM FKGFEKVKDV QYIYTPFDSS LCGVKLEANS QKQYLLTGQV 120 
LSDGKVFIHL CNYIEPWEDL SLVQKESLNH HYHLNCGCQI TTCYTVPCTI SAPNECLWTD 1 80 
WLLERKLYGY QAQHYVCMKH VDGTCSW YRG HLPLRKEFVD IVQP 

Seq ID NO: 129 DNA sequence 

Nucleic Acid Accession #: NM 007207.2 

Coding sequence: 143-1 591 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I I I I I „ 
CCACGCGTCC GCAATGAAGC CGAGTGAATG GGGGCTGAAT GTGCGAGTCC ATAGCTGAAG 60 

AGGAGCGCCA G ATGGTGG AG GAATACACTT ATTTATGAAA CTGTCTTG AG TTCTTCTTGA 1 20 

ATTGCCAGTT TTCAGCCTCC T CATGC CTCC GTCTCCTTTA G ACGACAGGG TAGTAGTGGC 1 80 

ACTATCTAGG CCCGTCCG AC CTCAGGATCT CAACCTTTGT TTAGACTCTA GTTACCTTGG 240 

CTCTGCCAAC CCAGGCAGTA ACAGCCACCC TCCTGTCATC GCCACCACCG TTGTGTCCCT 300 

CAAGGCTGCG AATCTGACGT ATATGCCCTC ATCCAGCGGC TCTGCCCGCT CGCTGAATTG 360 

TGGATGCAGC AGTGCCAGCT GCTGCACTGT GGCAACCTAC G ACAAGGACA ATCAGGCCCA 420 

AACCCAAGCC ATTGCCGCTG GCACCACCAC CACTGCCATC GGAACCTCTA CCACCTGCCC 480 

TGCTAACCAG ATGGTCAACA ATAATGAGAA TACAGGCTCT CTAAGTCCAT CAAGTGGGGT 540 

GGGCAGCCCT GTGTCAGGGA CCCCCAAGCA GCTAGCCAGC ATCAAAATAA TCTACCCCAA 600 

TGACTTGGCA AAGAAGATGA CCAAATGCAG CAAGAGTCAC CTGCCGAGTC AGGGCCCTGT 660 

CATCATTG AC TGCAGGCCCT TCATGGAGTA CAACAAGAGT CACATCCAAG GAGCTGTCCA 720 

CATTAACTGT GCCGATAAGA TCAGCCGGCG GAGACTGCAG CAGGGCAAGA TCACTGTCCT 780 

AGACTTGATT TCCTGTAGGG AAGGCAAGGA CTCTTTCAAG AGGATCTTTT CCAAAGAAAT 840 

TATAGTTTAT GATG AGAATA CCAATGAACC AAGCCGAGTG ATGCCCTCCC AGCCACTTCA 900 

CATAGTCCTC GAGTCCCTGA AGAGAGAAGG CAAAGAACCT CTGGTGTTGA AAGGTGGACT 960 

TAGTAGTTTT AAGCAG AACC ATGAAAACCT CTGTGACAAC TCCCTCCAGC TCCAAGAGTG 1020 

CCGGGAGGTG GGGGGCGGCG CATCCGCGGC CTCGAGCTTG CTACCTCAGC CCATCCCCAC 1080 

CACCCCTGAC ATCGAGAACG CTGAGCTCAC CCCCATCTTG CCCTTCCTGT TCCTTGGCAA 1 140 

TGAGCAGGAT GCTCAGGACC TGGACACCAT GCAGCGGCTG AACATCGGCT ACGTCATCAA 1200 

CGTCACCACT CATCTTCCCC TCTACCACTA TGAGAAAGGC CTGTTCAACT A CAAOC OGCT 1260 

GCCAGCCACT GACAGCAACA AGCAGAACCT GCGGCAGTAC TTTGAAGAGG CTTTTGAGTT 1320 

CATTGAGGAA GCTCACCAGT GTGGGAAGGG GCTTCTCATC CACTGCCAGG CTGGGGTGTC 1380 

CCGCTCCGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA CCATGACTGA 1440 

TGCTTATAAA TTTGTCAAAG GCAAACGACC AATTATCTCC CCAAACCTTA ACTTCATGGG 1500 

GCAGTTGCTA GAGTTCGAGG AAGACCTAAA CAACGGTGTG ACACCGAGAA TCCTTACACC 1560 

AAAGCTGATG GGCGTGGAGA CGGTTGTGTG AC AATGOTCT GOATGGA AAG GATTGCTGCT 1620 

CTCCATTAGG AGACAATGAG GAAGG AGGAT GGATTCTGGT TTTTTTTCTT TCTTTTTTTT 1680 

TTGTAGTTGG GAGTAAAGTT TGTGAATGGA AACAAACTTG GTTAAACACT TTATTTTTAA 1740 

CAAGTGTAAG AAG ACTATAC TTTTGATGCC ATTGAG ATTC ACCTTCCACA AACTGGCCAA 1 800 

ATTAAGGAGG TTAAAGAAGT AATTTTTTTT AAGCCCAACC ATTAAAAATT TAATACAACT 1860 

TGGTTTCTCC CCCTTTTTCC TTTAAAGCTA NTTTGTAAAA GTTTATGAG 

Reg tP NO: 130 Protein sequence: 
Protein Accession #: NP_009 1 38. 1 

I u 21 31 41 51 
I I I I I I 

MPPSPLDDRV V VALSRPVRP QDLNLCLDSS YLGSANPGSN SHPPVIATTV VSLKAANLTY 60 
MPSSSGSARS LNCGCSSASC CTVATYDKDN QAQTQAIAAG TTTTAIGTST TCPANQMVNN 120 
NENTGSLSPS SGVGSPVSGT PKQLASIKII YPNDLAKKMT KCSKSHLPSQ GPVIIDCRPF 180 

406 



WO 02/059377 



MEYNKSHIQG AVHINCADKI SRRRLQQGKI TVLDLISCRE GKDSFKRIFS KEHVYDENT 240 — 
NEPSJWMPSQ PLH1VLESLK REGKEPLVLK GGLSSFKQNH ENLCDNSLQL QECREVGGGA 300 
SAASSLLPQP IPTTPDIENA ELTPILPFLF LGNEQDAQDL DTMQRLNIGY VINVTTHLPL 360 
YHYEKGLFNY KRLPATDSNK QNLRQYFEEA FEFIEEAHQC GKGLUHCQA GVSRSATIVI 420 
AYLMKHTRMT MTDAYKFVKG KRPIISPNLN FMGQLLEFEE DLNNGVTPRI LTPKLMGVET 480 
W 

ScqIDN0:131 DNA sequence 

Nucleic Acid Accession #: NM_005409J 

Coding sequence: 94-378 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

1 ' I 1 1 1 . ^ 

TTCCTTTCAT GTTCAGCATT TCTACTCCTT CCAAGAAGAG CAGCAAAGCT GAAGTAGCAG 60 

CAACAGCACC AGGAGCAACA GCAAAAAACA AACAJGAGTG TGAAGGGCAT GGCTATAGCC 120 

TTGGCTGTGA TATTGTGTGC TACAGTTGTT CAAGGCTTCC CCATGTTCAA AAG AGGACGC 1 80 

TGTCTTTGCA TAGGCCCTGG GGTAAAAGCA GTGAAAGTGG CAGATATTGA GAAAGCCTCC 240 

ATAATGTACC CAAGTAACAA CTGTGACAAA ATAGAAGTGA TTATTACCCT GAAAGAAAAT 300 

AAAGGACAAC GATGCCTAAA TCCCAAATCG AAGCAAGCAA GGCTTATAAT CAAAAAAGTT 360 

GAAAGAAAGA ATTTTTAAAA ATATCAAAAC AtAtGAAGTC CTGGAAAAGG GCATCTGAAA 420 

AACCTAGAAC AAGTTTAACT GTGACTACTG AAATG ACAAG AATTCTACAG TAGGAAACTG 480 

AGACTTTTGT ATGGTTTTGT GACTTTCAAC TTTTGTACAG TTATGTGAAG GATGAAAGGT 540 

GGGTGAAAGG ACCAAAAACA GAAATACAGT CTTCCTGAAT GAATGACAAT CAGAATTCCA 60Q 

CTGCCCAAAG GAGTCCAGCA ATTAAATGGA TTTCTAGGAA AAGCTACCTT AAGAAAGGCT 660 

GGTTACCATC GG AG TTTACA AAGTGCTTTC ACGTTCTTAC TTGTTGTATT ATACATTCAT 720 

GCATTTCTAG GCTAG AGAAC CTTCTAG ATT TGATGCTTAC AACTATTCTG TTGTG ACTAT 780 

GAGAACATTT CTGTCTCTAG AAGTTATCTG TCTGTATTGA TCTTTATGCT ATATTACTAT 840 

CTGTGGTTAC AGTGG AGACA TTG ACATTAT TACTGG AGTC AAGCOCTTAT AAG TCAAA AO 900 

CATCTATGTG TCGTAAAGCA TTCCTCAAAC All ITU CAT GCAAATACAC ACTTCTTTCC 960 

CCAAATATCA TGTAGCACAT CAATATGTAG GGAAACATTC TTATGCATCA TTTGGTTTGT 1020 

TTTATAACCA ATTCATTAAA TGTAATTCAT AAAATGTACT ATGAAAAAAA TTATACGCTA 1080 

TGGGATACTG GCAACAGTGC ACATATTTCA TAACCAAATT AGCAGCACCG GTCTTAATTT 1 140 

GATGTTTTTC AACTTTTATT CATTGAGATG TTTTGAAGCA ATTAGGATAT GTGTGTTTAC, 1200 

TGTACTTTTT GTTTTGATCC GTTTGTATAA ATGATAGCAA TATCTTGGAC ACATTTGAAA 1260 

TACAAAATGT TTTTGTCTAC CAAAGAAAAA TGTTG AAAAA TAAGCAAATG TATACCTAGC 1320 

AATCACTTTT ACTTTTTGTA ATTCTGTCTC TTAG AAAAAT ACATAATCTA ATCAATTTCT 1380 

TTGTTCATGC CTATATACTG TAAAATTTAG GTATACTCAA GACTAGTTTA AAGAATCAAA 1440 

GTCATTTTTT TCTCTAATAA ACTACCACA A CCTTTCTTTT TTAAAAAAAA AAA 

Seq ID NO: 132 Protein sequence; 
Protein Accession #: NPJ)05400. 1 

1 11 21 31 41 51 
I I III I 

MSVKGMAIAL AVILCATWQ GFPMFKRGRC LCIGPGVKAV KVADIEKASI MYPSNNCDKI 60 
EVUTLKENK GQRCLNPKSK QARLIIKKVE RKNF 

Seq ID NO: 133 DNA sequence 

Nucleic Acid Accession #: NM_012342 

Coding sequence: 373- 11 55 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CTGGCGCGGG CGGGAGCTGC GGCGGATACC CTTGCGTGCT GTGGAGACCC TACTCTCTTC 60 
GCTGAGAACG GCCGCTAGCG GGGACTGAAG GCCGGG AGCC CACTCCCGAC CCGGGGCTAG 1 20 
CGTGCGTCCC TAG AGTCGAG CGGGGCAAGG GAGCCAGTGG CCGCCGACGG GGGACCGGG A 1 80 
AACTTTTCTG GGCTCCTGGA GAGCCCTGTA GCCGCGCTCC ATGCTCCGGC AGCGGCCCGA 240 
AACCCAGCCC CGCCGCTGAC GGAGCCCGCC GCTCCGGGCA GGGCCCATGC CCTGCGCGCT 300 
CCGGGGGTCG TAGCTGCCGC CGAGCCGGGG CTCCGGAAGC CGGCGGGGGC GCCGCGGCCG 360 
TGCGGGGCGT C A ATGGATCG CCACTCCAGC TACATCTTCA TCTGGCTGCA GCTGGAGCTC 420 
TGCGCCATGG CCGTGCTGCT CACCAAAGGT GAAATTCGAT GCTACTGTGA TGCTGCCCAC 480 
TGTGTAGCCA CTGGTTATAT GTGTAAATCT GAGCTCAGCG CCTGCTTCTC TAGACTTCTT 540 
GATCCTCAG A ACTCAAATTC CCCACTCACC CATGGCTGCC TGGACTCTCT TGCAAGCACG 600 
ACAG ACATCT GCCAAGCCAA ACAGGCCCGA AACCACTCTG GCACCACCAT ACCCACATTG 660 
GAATGCTGTC ATGAAGACAT GTGCAATTAC AG AGGGCTGC ACG ATGTTCT CTCTCCTCCC 720 
AGGGGTGAGG CCTCAGGACA AGGAAACAGG TATCAGCATG ATGGTAGCAG AAACCTTATC 780 
ACCAAGGTGC AGG AGCTGAC TTCTTCCAAA GAGTTGTGGT TCCGGGCAGC GGTCATTGCC 840 
GTGCCCATTG CTGGAGGGCT GATTTTAGTG TTGCTTATTA TGTTGGCCCT GAGGATGCTT 900 
CGAAGTGAAA ATAAG AGGCT GCAGGATCAG CGGCAACAGA TGCTCTCCCG TTTGCACTAC 960 
AGCTTTCACG GACACCATTC CAAAAAGGGG CAGGTTGCAA AGTTAGACTT GGAATGCATG 1020 
GTGCCGGTCA GTGGGCACGA GAACTGCTGT CTGACCTGTG ATAAAATGAG ACAAGCAGAC 1080 
CTCAGCAACG ATAAG ATCCT CTCGCTTGTT CACTGGGGCA TGTACAGTGG GCACGGG AAG 1 140 
CTGGAATTCG TATGACGGAG TCTTATCTGA ACTACACTTA CTGAACAGCT TGA AGGC CTT 1200 
TTG AGTTCTG CTGGACAGGA GCACTTTATC TGAAGACAAA CTCATTTAAT CATCTTTGAG 1 260 
AGACAAAATG ACCTCTGCAA ACAGAATCTT GGATATTTCT TCTGA AGGAT TATTTGCACA 1320 
GACTTAAATA CAGTT AAATG TGTTATTTGC TTTTAAAATT AT AAA AAG C A AAG AG AAG AC 1380 
TTTGTACACA CTGTCACCAG GGTTATTTGC ATCCAAGGGA GCTGGAATTG AGTACCTAAA 1440 
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TAAACAA AA A TGTGCCCTAT GTAAGCTTCT ACATCTTGAT TTATTGTAAA GATTTAAAAG 1 500 
AAATATATAT ATTTTGTCTG A 

Seq ID NO: 134 Protein sequence: 
Protein Accessions: NP_036474.1 

1 11 21 31 41 SI 

MDRHSSYIFI WLQLELCAMA VLLTKGEIRC YCDAAHCVAT GYMCKSELSA CFSRLLDPQN 60 
SNSPLTHGCL DSLASTTDIC QAKQARNHSG TTIPTLECCH EDMCNYRGLH DVLSPPRGEA 120 
SGQGNRYQHD GSRNLITKVQ ELTSSKELWF RAAVIAVPIA GGLILVLLIM LALRMLRSEN 1 80 
KRLQDQRQQM LSRLHYSFHG HHSKKGQVAK LDLECMVPVS GHENCCLTCD KMRQADLSND 240 
KILSLVHWGM YSGHGKLEFV 

Seq ID NO: 135 DNA sequence 

Nucleic Acid Accession #: NM_O0 1 627. 1 

Coding sequence: 64*1815 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

CGGGACGACG CCCCCTCCTG CGGCGTGGAC TCCGTCAGTG GCCCACCAAG AAGGAGGAGG 60 
AATATGGAAT CCAAGGGGGC CAGTTCCTGC CGTCTGCTCT TCTGCCTCTT GATCTCCGCC 120 
ACCGTCTTCA GGCCAGGCCT TGGATGGTAT ACTGTAAATT CAGCATATGG AGATACCATT 1 80 
ATCATACCTT GCCGACTTGA CGTACCTCAG AATCTCATGT TTGGCAAATG GAAATATGAA 240 
AAGCCCGATG GCTCCCCAGT ATTTATTGCC TTCAGATCCT CTACAAAGAA AAG TGTG CAG 300 
TACGACG ATG TACCAGAATA CAAAGACAGA TTGAACCTCT CAGAAAACTA CACTTTGTCT 360 
ATCAGTAATG CAAGGATCAG TGATGAAAAG AGATTTGTGT GCATGCTAGT AACTG AGG AC 420 
AACGTGTTTG AGGCACCTAC AATAGTCAAG GTGTTCAAGC AACCATCTAA ACCTGAAATT 480 
GTAAGCAAAG CACTGTTTCT CGAAACAGAG CAGCTAAAAA AGTTGGGTGA CTGCATTTCA 540 
GAAGACAGTT ATCCAGATGG CAATATCACA TGGTACAGG A ATGG AAAAGT GCTACATCCC 600 
CTTGAAGGAG CGGTGGTCAT AATTTTTAAA AAGGAAATGG ACCCAGTGAC TCAGCTCTAT 660 
ACCATGACTT CCACCCTGGA GTACAAGACA ACCAAGGCTG ACATACAAAT GCCATTCACC 720 
TGCTCGGTGA CATATTATGG ACCATCTGGC CAGAAAACAA TTCATTCTGA ACAGGCAGTA 780 
TTTG ATATTT ACTATCCTAC AG AGCAGGTG ACAATACAAG TGCTGCCACC AAAAAATGCC 840 
ATCAAAG AAG GGGATAACAT CACTCTTAAA TGCTTAGGGA ATGGCAACCC TCCCCCAGAG 900 
GAATTTTTGT TTTACTTACC AGGACAGCCC GAAGGAATAA GAAGCTCAAA TACTTACACA 960 
CTGATGGATG TGAGGCGCAA TGCAACAGGA GACTACAAGT GTTCCCTGAT AGACAAAAAA 1020 
AGCATGATTG CTTCAACAGC CATCACAGTT CACTATTTGG ATTTGTCCTT AAACCCAAGT 1080 
GGAGAAGTGA CTAGACAGAT TGGTGATGCC CTACCCGTGT CATGCACAAT ATCTGCTAGC 1 140 
AGGAATGCAA CTGTGGTATG GATGAAAGAT AACATCAGGC TTCGATCTAG CCCGTCATTT 1200 
TCTAGTCTTC ATTATCAGGA TOCTGGAAAC TATGTCTGCG AAACTGCTCT GCAGGAGGTT 1260 
GAAGGACTAA AGAAAAGAGA GTCATTGACT CTCATTGTAG AAGGCAAACC TCAAATAAAA 1320 
ATGACAAAGA AAACTGATCC CAGTGGACTA TCTAAAACAA TAATCTGCCA TGTGGAAGGT 1380 
TTTCCAAAGC CAGCCATTCA GTGGACAATT ACTGGCAGTO GAAGCGTCAT AAACCAAACA 1440 
GAGGAATCTC CTTATATTAA TGGCAGGTAT TATAGTAAAA TTATCATTTC CCCTGAAGAG 1500 
AATGTTACAT TAACTTGCAC AGCAGAAAAC CAACTGGAGA GAACAGTAAA CTCCTTGAAT 1560 
GTCTCTGCTA TAAGTATTCC AGAACACGAT GAGGCAGACG AGATAAGTGA TGAAAACAGA 1620 
GAAAAGGTGA ATGACCAGGC AAAACTAATT GTGGGAATCG TTGTTGGTCT CCTCCTTGCT 1680 
GCCCTTGTTG CTGGTGTCGT CTACTGGCTG TACATGAAGA AGTCAAAGAC TGCATCAAAA 1740 
CATGTAAACA AGG ACCTCGG TAATATGGA A GAAAACAA AA AGTTAGAAG A AAACAATCAC 1 800 
AAAACTGAAG CCTAAGAGAG AAACTGTCCT AGTTGTCCAG AGATAAAAAT CATATAGACC 1860 
AATTGAAGCA TGAACGTGGA TTGTATTTAA GACATAAACA AAGACATTGA CAGCAATTCA 1920 
TGGTTCAAGT ATTAAGCAGT TCATTCTACC AAGCTGTCAC AGGTTTTCAG AGAATTATCT 1980 
CAAGTAAAAC AAATGAAATT TAATTACAAA CAATAAGAAC AAGTTTTGGC AGCCATGATA 2040 
ATAGGTCATA TGTTGTGTTT GGTTCAATTT TTTTTCCGTA AATGTCTGCA CTGAGGATTT 2 100 
CTTTTTGOTT TGC CTTT TAT GTAAATTTTT TACGTAGCTA TTTTTATACA CTGTAAGCTT 2 1 60 
TGTTCTGGGA GTTGCTGTTA ATCTGATGTA TAATGTAATG TTTTTATTTC AATTGTTTAT 2220 
ATGGATAATC TGAGCAGGTA CATTTCTGAT TCTGATTGCT ATCAGCAATG CCCCAAACTT 2280 
TCTCATAAGC ACCTAAA ACC CAAAGGTGGC AGCTTGTGAA GATTGGGGAC ACTCATATTG 2340 
CCCTAATTAA AAACTGTGAT TTTTATCACA AGGGAGGGGA GGCCGAG AGT CAGACTGATA 2400 
GACACCATAG GAGCCG ACTC TTTGATATGC CACCAGCGAA CTCT CAGA AA TAAATCACAG 2460 
ATGCATATAG ACACACATAC ATAATGGTAC TCCCAAACTG ACAATTTTAC CTATTCTGAA 2520 
AAAGACATAA AACAGAATT 

Seq ID NO: 136 Protein sequence: 
Protein Accession #: NPJJ0161 8. 1 

1 11 21 31 41 51 

MESKGASSCR LLFCLLISAT VFRPGLGWYT VNSAYGDTII IPCRLDVPQN LMFGKWKYEK 60 
PDGSPVFIAF RSSTKKSVQY DDVPEYKDRL NLSENYTLSI SNARISDEKR FVCMLVTEDN 120 
VFEAPTIVKV FKQPSKPEIV SKALFLETEQ LKXLGDCISE DSYPDGNITW YRNGKVLHPL 180 
EGAW1IFKK EMDPVTQLYT MTSTLEYKTT KADIQMPFTC SVTYYGPSGQ KTIHSEQAVF 240 
DIYYPTEQVT IQVLPPKNAI KEGDN1TLKC LGNGNPPPEE FLFYLPGQPE GIRSSNTYTL 300 
MDVRRNATGD YKCSLIDKKS MIASTAITVH YLDLSLNPSG EVTRQIGDAL PVSCTISASR 360 
NATVVWMKDN IRLRSSPSFS SLHYQDAGNY VCETALQEVE GLKKRESLTL IVEGKPQDCM 420 
TKKTDPSGLS KTIICHVEGF PKPAIQWTTT GSGSVINQTE ESPYINGRYY SKMSPEEN 480 
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VTLTCTAENQ LERTVNSLNV SAISIPEHDE ADEISDENRE KVNDQAKLIV 01 WGLLLAA 540 
LVAQWYWLY MKKSKTASKH VNKDLGNMEE NKKLEENNHK TEA 



Scq ID NO: 137 DNA sequence 

Nucleic Acid Accession #: XM_030559 

Coding sequence: 1-1 19 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

I I I 1 I 1 _ 
ATGAACCGCA GCCACCGGCA CGGGGCGGGC AGCGGCTGCC TGGGCACTAT GGAGGTGAAG 60 
AGCAAGTTTG GAGCTGAATT TCGTCGGTTT TCGCTGGAAA GATCAAAACC TGGAAAATTT 1 20 
GAGGAGTTTT ATGGATTACT ACAACATGTT CATAAGATCC OCAATGTTGA CGTTTTGGTA 180 
GGCTATGCAG ACATCCATGG AG ACTTACTA CCTATAAATA ATGATGATAA TTATCACAAA 240 
GCTGTTTCAA CGGCCAATCC ACTGCTTAGG ATATTTATAC AAAAGAAGGA AGAAGCAGAC 300 
TACAGTGCCT TTGGTACAGA CACGCTAATA AAGAAGAAGA ATGTTTTAAC CAA CGTA TTG 360 
CGTCCTGACA ACCATAGAAA AAAGCCACAT ATAGTCATTA GTATGCCCCA AGACTTTAGA 420 
CCTGTGTCTT CTATTAT AGA CGTGGATATT CTCCCAG AAA CGCATCGTAG GGTACGTCTT 480 
TACAAATACG GCACGGAGAA ACCCCTAGGA TTCTACATCC GGGATGGCTC CAGTGTCAGG 540 
GTAACACCAC ATGGCTTAGA AAAGGTTCCA GGGATCTTTA TATCCAGGCT TGTCCCAGGA • 600 
GGTCTGGCTC AAAGTACAGG ACTATTAGCT GTTAATGATG AAGTTTTAG A AGTTAATGGC 660 
ATAGAAGTTT CAGGG AAGAG CCTTGATCAA GTAACAGACA TGATGATTGC AAATAGCCGT 720 
AACCTCATCA TAACAGTGAG ACCGGCAAAC CAGAGGAATA ATGTTGTGAG GAACAGTCGG 780 
ACTTCTGGCA GTTCCGGTCA GTCTACTGAT AACAGCCTTC TTGGCTACCC ACAGCAGATT 840 
GAACCAAGCT TTGAGCCAGA GGATGAAGAC AGCGAAGAAG ATGACATTAT CATTG AAGAC 900 
AATGGAGTGC CACAGCAGAT TCCAAAAGCT GTTCCTAATA CTGAGAGCCT GGAGTCATTA 960 
ACACAGATAG AGCTAAGCTT TGAGTCTGGA CAGAATGGCT TTATTCCCTC TAATGAAGTG 1020 
AGCTTAGCAG CCATAGCAAG CAGCTCAAAC ACGGAATTTG AAACACATGC TCCAGATCAA 1080 
AAACTCTTAG AAGAAGATGG AACAATCATA ACATTATGA 

Seq ID NO: 138 Protein seouence: 
Protein Accession ft XP_030559 

1 11 21 31 41 51 
I I I I I I 

MNRSHRHGAG SGCLGTMEVK SKFGAEFRRF SLERSKPGKF EEFYGLLQHV HKIPNVDVLV 60 
GYAD1HGDLL PINNDDNYHK A VSTANPLLR IFIQKKEEAD YSAFGTDTLI KKKNVLTNVL 120 
RPDNHRKKPH IVISMPQDFR PVSSHDVDI LPETHRRVRL YKYGTEKPLG FYIRDGSSVR 180 
VTPHGLEKVP GIFISRLVPG GLAQSTGLLA VNDEVLEVNG IEVSGKSLDQ VTDMMIANSR 240 
NUITVRPAN QRNNWRNSR TSGSSGQSTD NSLLGYPQQI EPSFEPEDED SEEDDIIIED 300 
NGVPQQIPKA VPNTESLESL TQIELSFESG QNGFIPSNEV SLAAIASSSN TEFETHAPDQ 360 
KLLEEDGTU TL 
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It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 

i 
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WHAT IS CLAIMED IS: . 

1 1 . A method of detecting a breast cancer-associated transcript in a cell 

2 from a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridizes to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1-25. 

» 

■ ■ 

1 2. The method of claim 1 , wherein the biological sample comprises 

2 isolated nucleic acids. 

1 3. The method of claim 2, wherein the nucleic acids are mRNA. 

1 4. The method of claim 2, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 5 . The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-25. 

■ 

1 6. The method of claim 1 , wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 7. The method of claim 1, wherein the patient is undergoing a therapeutic 

2 regimen to treat breast cancer. 

1 8. The method of claim 1 , wherein the patient is suspected of having 

2 breast cancer. 

1 9. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1 -25. 

1 10. The nucleic acid molecule of claim 9, which is labeled. 

■ 

1 11. An expression vector comprising the nucleic acid of claim 9. 

1 12. A host cell comprising the expression vector of claim 1 1 . 
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l.i 13. An isolated polypeptide which is encoded by.a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1-25. 

■i - ■ ■ 

1 14. An antibody that specifically binds a polypeptide of claim 13. 

. ■ ■ 

1 15 . The antibody of claim 14, further conjugated to. an effector component. 

1 16. The antibody of claim 15, wherein the effector component is a 

2 fluorescent label. 

1 17. The antibody of claim 15, wherein the effector component is a 

2 radioisotope or a cytotoxic chemical. 

■ 

1 18. The antibody of claim 15, which is an antibody fragment 

■ 

1 19. The antibody of claim 15, which is a humanized antibody 

1 20. A method of detecting a breast cancer .cell in a: biological sample from 

2 a patient, the method comprising contacting the biological sample with an antibody of claim 

■ * 

3 14. 

1 21. The method of claim 20, wherein the antibody is further conjugated to 

2 an effector component. 

1 22. The method of claim 21, wherein the effector component is a 

2 fluorescent label. 

1 23 . A method for identifying a compound that modulates a breast cancer- 

2 associated polypeptide, the method comprising the steps of: 

3 (i) contacting the compound with a breast cancer-associated polypeptide, the 

4 polypeptide encoded by a polynucleotide that selectively hybridizes to a sequence at least 

5 80% identical to a sequence as shown in Tables 1-25; and 

6 (ii) determining the functional effect of the compound upon the polypeptide. 

1 24. A drug screening assay comprising the steps of 

■ 
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2 (i) administering a test compound to a mammal having breast cancer or a cell 

3 isolated therefrom; 

4 (ii) comparing the level of gene expression of a polynucleotide that selectively 

5 hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-25 in a 

i 

6 treated cell or mammal with the level of gene expression of the polynucleotide in a control 

7 cell or mammal, wherein a test compound that modulates the level of expression of the 

8 polynucleotide is a candidate for the treatment of breast cancer. 
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